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Abstract 

For the whole class of linear term rewriting systems, we define bottom- 
up rewriting which is a restriction of the usual notion of rewriting. We 
show that bottom-up rewriting effectively inverse-preserves recognizability 
and analyze the complexity of the underlying construction. The Bottom- 
Up class (BU) is, by definition, the set of linear systems for which every 
derivation can be replaced by a bottom-up derivation. Membership to BU 
turns out to be undecidable, we are thus lead to define more restricted 
classes: the classes SBU(fc),fc £ N of Strongly Bottom-Up{k) systems for 
which we show that membership is decidable. We define the class of 
Strongly Bottom-Up systems by SBU — [J^.i-^SBU{k). We give a poly- 
nomial sufficient condition for a system to be in SBU. The class SBU 
contains (strictly) several classes of systems which were already known 
to inverse preserve recognizability: the inverse left-basic semi-Thue sys- 
tems (viewed as unary term rewriting systems), the linear growing term 
rewriting systems, the inverse Linear-Finite-Path-Ordering systems. 

Keywords: Term rewriting systems; Semi-Tliuc systems; Regularity preserva- 
tion; Accessibility problem. 
AMS classification: 68Q42,03D03,03D40 



1 Introduction 

General framework An important concept in rewriting is the notion of 
preservation of recognizability through rewriting. Each identification of a more 
general class of systems preserving recognizability, yields almost directly a new 
decidable call- by- need [11] class, decidability results for confluence, accessibil- 
ity, joinability. Also, recently, this notion has been used to prove termination of 
systems for which none of the already known termination techniques work [15]. 
Such a preservation property is also a tool for studying the recognizable/rational 
subsets of various monoids which are defined by a presentation {X, TV) , where 
X is a finite alphabet and TZ a Thue system (see for example [21, 22]). Conse- 
quently, the seek of new decidable classes of systems which preserve (or inverse 
preserve) recognizability is well motivated. 

Many such classes defined so far have been defined by imposing syntactical 
restrictions on the rewrite rules. For instance, in growing systems ([17, 23]) 
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variables at depth strictly greater than 1 in the left-handside of a rule cannot 
appear in the corresponding right-handside. Finite-path Overlapping systems 
[30] are also defined by syntactic restrictions on the system. The class of Finite- 
path Overlapping systems contains the class of growing systems [23]. Previous 
works on semi-Thue systems were also proving such a recognizability preserva- 
tion, under syntactic restrictions: cancellation systems [2], monadic systems [4], 
basic systems [1], and left-basic systems [26] (sec [28] for a survey). 

Other works establish that some strategies i.e. restrictions on the derivations 
rather than on the rules, ensure preservation of recognizability. Various such 
strategies were studied in [13], [25], [29]. 

We rather follow here this second approach: we define a new rewriting strat- 
egy which we call bottom-up rewriting for linear term rewriting systems. The 
bottom-up derivations are, intuitively, those derivations in which the rules are 
applied, roughly speaking, from the bottom of the term towards the top (this 
set of derivations contains strictly the bottom-up derivations of [25] and the 
one-pass leaf-started derivations of [13]). An important feature of this strat- 
egy, as opposed to the ones quoted above, is that it allows overlaps between 
successive applications of rules. A class of systems is naturally associated with 
this strategy: it consists of the systems TZ for which the binary relation — >^ 
coincides with its restriction to the bottom-up strategy. We call "bottom-up" 
such systems and denote by BU the set of all bottom-up systems. 

Overview of the paper Most of the results proved in this paper were an- 
nounced in [12], which can thus be considered as a medium-scale overview of 
this paper. Let us give here a large-scale overview, section by section, of the 
contents of the paper. 

In section 2, we have gathered all the necessary recalls and notation about 
words, terms, rewriting and automata. 

In Section 3, we define bottom-up rewriting for linear term rewriting systems 
using marking techniques. Wc first define bottom-up{k) derivations for /c € N 
(bu(fc) derivations for short) and the classes Bottom-up(fc) (BU(fc) for short) 
of linear systems which consists of those systems which admit bu(fc) rewriting, 
i.e. such that every derivation between two terms can be replaced by a bu(fc) 
derivation, and the Bottom-up class (BU) of bottom-up systems which is the 
infinite union of the BU(fc) (for k varying in N). 

In Section 4 we prove Theorem 4.2 which is the main result of the pa- 
per: bottom-up rewriting inverse-preserves recognizability. Our proof consists 
of a reduction to the preservation of recognizability by finite ground systems, 
shown in [5] , [9] . The proof is constructive i. e gives an algorithm for comput- 
ing an automaton recognizing the antecedents of a recognizable set of terms. 
We estimate the complexity of the algorithm: a separate tight upper-bound is 
given for BU~(1) semi-Thue systems; another upper-bound is given for BU (1) 
term rewriting systems; finally, a general upper-bound is given for BU~(A;) term 
rewriting systems. We then give a lower bound for BU~(1) term rewriting sys- 
tems showing that some of our upper-bounds cannot be easily improved. 
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In Section 5, we show that BU contains all the classes of semi-Thue systems 
quoted above (once translated into term rewriting systems in which all symbols 
have arity or 1), and also the linear growing systems of [17]. We study the 
decidability of membership to the BU(fc) classes. Wc show that membership to 
BU(/c) is undecidable for fc > 1 even for semi-Thue systems. 

In Section 6, we define the restricted class of strongly hottom-up{k) systems 
(SBU(fc)) for which we show decidable membership. We define the class of 
strongly bottom-up systems SBU = [J^.^jq SBU(fc). Based on the results of [19] it 
seems likely that the property [3k > such that TZ G SBU(fc)] (so membership 
to SBU) is undecidable. We give a polynomial sufficient condition for a system 
to be in SBU. We finally show that LFPO"^ C SBU. 
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2 Preliminaries 

This section is mostly devoted to recalling some classical notions and making 
precise our notation. The reader is referred to [6] for more details on the subject 
of tree-automata and to [18] for term rewriting. 

2.1 Sets, binary relations 

Given a set E, we denote by ViE) its powerset i.e. the set of all its subsets. 
For every sets E^ F and every binary relation — >C E x F, and every subsets 
E' C E,F' C F, we denote by E' F' the fact that 3e e E' , 3f e F',e^ f. 
We sometimes abusively note e ^ F for what should be written {e} F. 
The inverse binary relation — is defined by 

V/ eF,\feeE,f -^-'^ e ^ e ^ /. 

Let E, F be two sets endowed with binary relations -^eQ E x E, F x F. 

Definition 2.1. A binary relation R C E x F is called a simulation of the 
structure {E,^e) by the structure {F,^f) iff 

Vei,e2 e ^,V/i G F, [((ei ->b e2)A(eii?/i)) ^ (3/2 £ F, (/i -^p f2)f\{e2Rf2))] 

This is essentially the classical notion of simulation defined in [24] , excepted 
that we do not impose on R to be everywhere defined. 

2.2 Words and Terms 

A finite word over an alphabet A is a map u : [0, ^ — 1] A for some ^ G N. 
The integer I is the length of the word u and is denoted by | The set of words 
over A is denoted by A* and endowed with the usual concatenation operation 
u^v & A* ^ u ■ V & A* . The empty word is denoted by e. A word w is a prefix 
of a word v iff there exists some w £ A* such that v — uw. We then note 
v\u :— w. We denote hy u < v the fact that u is a prefix of v and by u <iex v 
the fact that u is lexicographically smaller than (or equal to) v. 

Given w € A* \ {e}, we denote by last(u') the last (i.e. rightmost) letter of 

w. 

We call signature a set of symbols T with fixed arity ar : ^ N. The subset 
of symbols of arity m is denoted by J-m- 



4 



As usual, a set P C N* is called a tree-domain (or, domain, for short) iff , 
for every u S N* , z G N 

{u ■ i e P ^ u e P) k {u ■ {i + 1) e P =^ u ■ i € P). 

We call P' C P a. subdomain of P iff, P' is a domain and, for every u G P, i G N 

{u-ieP' ku - + e P)^u-{i + i)e P'. 

A chain of P is a subset C C P which is linearly ordered by ^. A subset 
i? C P is called a branch of T iff it is a chain, which is maximal for inclusion. 
A subset P' C P is called a pai/i of T iff it is a chain, which is an interval i.e.: 
\fx, z G P' ,yy (^y^P, x<y<z^y£ P' . An antichain of P is a subset 
^ C P such that, for every u, it' G A, u ^ u' => u = u' . We often denote a 
finite antichain by the sequence of its elements in increasing lexicographic order. 
We sometimes do not distinguish between the antichain and this sequence. A 
subset r C P is called a transversal of T iff it is an antichain, which is maximal 
for inclusion. The ordering < is extended to transversals in the following way: 
T <T' iff, Vu G T, 3u' G T', u ^ u' . Given Q C P, the closure of Q in the tree- 
domain P; denoted CL(Q, P), is the smallest superset of Q which is a subdomain 
of P. 

Lemma 2.2. Let P C N* be a tree domain. Let Y C P be an antichain. There 
exists a transversal Z of P such that 

1- Y C Z 

2- for every transversal TofP,Y<ZT^Z^T 

3- \lz & Z,\lv & P,3y & Y,{v < z ^ V -<y). 

Sketch of proof. Let 
Z := {z e P \ 3y eY,3u e P,3a en,u < y k z = u ■ a k {Wy' &Y,z y')}. 

This set Z fulfills points (1)(2)(3). □ 

After Lemma 2.2, we denote by Tr(F, P) the transversal Z determined by Y 
and P and we call it the smallest transversal containing the antichain Y of the 
tree-domain P. 

A (first-order) term on a signature J- is a partial map t : N* ^ J- whose domain 
is a tree-domain and which respects the arities. We denote by T{T, V) the set 
of first-order terms built upon the signature J- UV, where .F is a denumerable 
signature and V is a denumerable set of variables of arity 0. 

The domain of t is also called its set of positions and denoted by Vos{t). The 
set of variable positions (resp. non variable positions) of a term t is denoted by 
Po5v{t) (resp. Posy(i)). The set of leaves of t is the set of positions u G Pos{t) 
such that u • N n Pos(t) = 0. It is denoted by £v(i). The set of internal nodes 
of t is the set of positions u G Pos(f) such that u ■ N n Pos(t) 0. It is denoted 
by In{t). We write Vos+{t) for Pos(i) \ {e}. If u,v G Pos(t) and u ^ v, 
we say that u is an ancestor of v in t. Given v G Pos'*'(t), its father is the 
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position u such that v = uw and \w\ = 1. The depth of a term t is defined by: 
dpt{t) := sup{|ti| I u G Vos-^{t)} + 1. Given a term t and u G ■pos(t) the subterm 
of t at u is denoted by </m and defined by Vos{t/u) = {w \ uw S 'Pos(i)} and 
Vw 6 'Pos{t/u), t/u{w) = t{uw). A term s is a prefix of the term f iff there 
exists a substitution a such that scr = t. Given a hnear term t G T(JF, V), 
a; G Var(t), we shall denote by pos(t, x) the position of x int. A term containing 
no variable is called ground. The set of ground terms is abbreviated to T{J-) or 
T whenever T is understood. A term which does not contain twice the same 
variable is called linear. 

Among all the variables, there is a special one designated by □. A term 
containing exactly one occurrence of □ is called a context. We denote by Ci {J-) 
the set of all contexts over T. 

A context is usually denoted as C[]. If u is the position of □ in C[], C[t] denotes 
the term C[] where t has been substituted at position u. We also denote by 
C\\u such a context and by C[t]u the result of the substitution. A term s is a 
factor of the term t iff there exists a context CQ^ and a substitution a such that 
t = C[sa\. In this case, we call occurrence of s in t the subset of Vos{t) which 
corresponds to the non- variable positions of s i.e. u7^0Sy(.s). 

We denote by |t| := Card(7'os(t)) the size of a term t. 
Two terms t, t' G ^{^F, V) are called a-equivalent iff, there exists a substitution 
(T : V ^ V which is a permutation of the set V, and such that ta = t' . In this 
case we note t =a t' . 

2.3 Semi-Thue systems 

Let A be a set that we take as alphabet. A rewrite rule over the alphabet A 
is a pair u — v of words in A*. We call u (rcsp. v) the left-handside (rcsp. 
right-handside) of the rule {Ihs and rhs for short). A semi-Thue system is a 
pair (S, A) where A is an alphabet and S a set of rewrite rules built upon the 
alphabet A. When A is clear from the context or contains exactly the symbols 
in S, we may omit A and write simply 5*. We call size of the set of rules S 
the number ||5|| := J2u-^v£S I*"! ~^ i^l- '^^^ one-step derivation generated by S 
(which is denoted by ^s) is defined by: for every /, g G A* , f — >5 g iff there 
exists u — s- w G 5 and a, /? G A* such that / = au/3 and g = av(3. The relation 
(the derivation generated by S) is the rcfiexivc and transitive closure of the 
relation -^s- The semi-Thue system {S,A) is called length-increasing iff, for 
every u v E S, \u\ < \v\. 

2.4 Term rewriting systems 

A rewrite rule built upon the signature ^ is a pair / — > r of terms in T{ J-, V) 
which satisfy Var(r) C Var{l). We call I (resp. r) the left-handside (resp. right- 
handside) of the rule (Ihs and rhs for short). A rule is linear if both its left and 
right-handsidcs are linear. A rule is left-linear if its left-handside is linear. 

A term rewriting system {system for short) is a pair {TZ,J-) where J- is 
a signature and TZ a set of rewrite rules built upon the signature J-. When 
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T is clear from the context or contains exactly the symbols of 7?., we may 
omit T and write simply Tl.. We call size of the set of rules 7?, the number 
I^ll ■= S(^r6TC 1^1 + Rewriting is defined as usual: for every t,t' G T{T,V), 
t -^Tz t' means that there exists C e Ci{T U V),l r £ U, a : V ^ T{T,V) 
such that 

t^C[la], t'^C[ra]. (1) 

Let us fix some one-step derivation (1) and denote by u the position of the hole 
in C. Let v G Vos{t),v' e Vos{t'). We call v' a residue of tj, w.r.t. the one-step 
derivation (1), iff 

- either there exists x E Var(Z) n Var{r),w G Pos{xa),vi G Vos{l),v[ G Vos{r) 
such that 

l{vi) = X, r{v[) = x,v = uviw, v' — uv[w, 

- or 

V = v' ,v E Vos{C), u ^ V. 

This notion extends to factors in the following way: Let s be a factor of the 
term t (resp. t'). An occurrence p' of s in i' a residue of an occurrence p of s in 
t, w.r.t. the one-step derivation (1), iff 

- either there exists x G Var(Z) n Var{r),w G Pos{xa),Vi G Vos{l),v[ G Vos{r) 
such that 

l{vi) = x,r{v[) = x,p = uviVoSy{s),p' ~ uv'{Pos^{s), 

- or 

P = p' CPos^(C). 

Given a derivation 

D : tt) — »7j ti -^Ti ■ ■ - ti -^Ti ti+i ~^Ti • ■ - tn 

and V G Vos{tQ),v' G 7'os(t„), we call v' a residue of v, w.r.t. derivation D, 
iff, there exists positions Vi G Vositi) for < i < ri such that, ij = for 
every i G [0,n — 1], Ui+i is a residue of w.r.t. the i-th step of derivation D 
and Vn ~ v' . Similarly, if p C 7^05(^0 ) is an occurrence of a term s in Iq and 
p' C 'Pos{tn) is an occurrence of the same term s in t„, we call p' a residue of 
p, w.r.t. derivation £>, iff, there exists occurrences pi C Vositi) of s in for 
< z < n such that, p = po, for every i G [0, n — 1], Pi+i is a residue of p,; w.r.t. 
the i-th step of derivation D and p„ = p' ■ A system is linear (resp. left-linear) 
if each of its rules is linear (resp. left-linear). A system TZ is growing [17] if 
every variable of a right-handside is at depth at most 1 in the corresponding 
left-handside. 

2.5 Words viewed as Terms 

In order to transfer every definition (or statement) about Term Rewriting Sys- 
tems into a similar one about Semi-Thue systems, we define precisely here an 
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embedding of the set of words (resp. semi-Thue systems) over an alphabet A 
into the set of terms (resp. Term Rewriting Systems) over some signature J-. 
Let A be some alphabet. We define the signature ^{A) by 

TiA) A U {#}, Va e A, ar{a) = 1 and ar(#) = 0. 

We define two mappings Ti : A* T{!F{A), {x}) {i G {0, 1}) by introducing a 
variable x and setting: 

Ti{e) = x, Ti{aia2 ■ ■ - an) = ai(a2(. . . (a„(a;)) . . .)), 

•^o(e) = #, ■^o(aia2 • --an) = ai(a2(. . . (a„(#)) . . .)), 
Wc associate with every rewriting rule u ^ v , the (term) rewriting rule 

!F{u v) J-i{u) J-'i(v), 

and with every semi-Thue system {S, A) the term-rewriting system 

{T{S),T{A)) where T{S) := {J^(m v) \ u v e S}. 

The following lemma is straightforward 

Lemma 2.3. Let (5*,^) be a semi-Thue system and w,w' £ A*. Then 
w^s w' ^ To{w) ->r(s) •^o(w') <^ Ti{w) -^j^(s) ^i{w'). 

In the sequel wc shall often omit some parentheses in the unary terms: liw = 
0.10,2 ■ ■ ■ o„ where Oi € A and t G T{J-{A), {a;}), the term ai(a2(. . . (a„(x)) . . .))[t/a;] 
will be abbreviated as w{t). 

2.6 Automata 

Wc shall consider bottom-up term (tree) automata only [6] (which wc abbreviate 
to f.t.a ). An automaton A is given by a 4-tuple {J^,Q,Qf,r) where T is the 
signature, Q is a set of symbols of arity 0, called the set of states, Qf is the set 
of final states, F is the set of transitions. Every element of F has the form 

q^r (2) 

for some 5, r G Q, or 

f{qi,...,q„i) ^ q (3) 

for some m > 0, / G Tm, 9i, • ■ • , (Zm G <3- The size of A is defined by: ||^|| := 
Card(F) -|- Card((5). The set of rules F can be viewed as a rewriting system over 
the signature !F LI Q. We then denote by — >r or by — s-^ (resp. by or by 
— >^) the one-step rewriting relation (resp. the rewriting relation) generated by 
F. 

Given an automaton A, the set of terms accepted by A is defined by: 
L{A) ■.= {teT{T) I 3qeQf,t^\ q}. 
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A set of terms T is recognizable if there exists a term automaton A such that 
T = L{A). 

The automaton A is called deterministic iff for every i, u, u' G T{J- U Q), 

^ w e r & t ^ u' e r) ^ (u = u')- 

The automaton A is called complete iff for every m > 0, f £ J-m and m-tuple 
of states (qi, . . . ,(7m) G Q™, either (m = and f £ Q) or, there exists q £ Q 
such that 

■ ■■,*«) q£T. 

Beside the above usual properties we introduce here the notion of standard 
automaton as follows: 

Definition 2.4. A n.f.t.a. A — {J-,Q,Qf,r) is ca/fed standard ijf 

1- ToCQ 

2- every rule (3) of A is such that to > 1 

3- for every m > 1, f £ Tm^ qi, . . . ,qm £ Q there exists a unique q £ Q such 
that f{qi, . . .,qrn) -^A 9- 

We give later on, in §4.2.2, a precise procedure transforming any n.f.t.a. A 
into a standard f.t.a. A with "similar" rewriting relation, hence recognizing the 
same language. 

2.7 Automata and rewriting 

Given a system TZ and a set of terms T, we define 

{-^n)\T] = {s e T{T) I s t for some t £ T} (4) 

and 

[T] {-^n) = {-^ e r{T) \t^*T^s for some t £ T] (5) 

A system TZ is recognizability preserving if \T\{-^'^) is recognizable for every 
recognizable T. 

A system TZ is inverse recognizability preserving if is recognizable 

for every recognizable T or equivalently if TZ~^ is recognizability preserving. 

Some technical notions The following lemma extends the property of de- 
terminism to tree-domains larger than just a single point. 

Lemma 2.5. Let A be some standard f.t.a over the signature T . Let t, ti,t2 £ 
T{TUQ). Ift -^*^ ti,t^\ t2 and Vos{ti) = Vos{t2), then ti ta- 

We extend to subdomains the usual notion of state reached by some deter- 
ministic complete f.t.a from a given term t: we call it the reduct of t over the 
subdomain P. 
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Definition 2.6 (^-reduct). Let A be some standard f.t.a over the signature 
T. Let t G T(jF U Q) and let P be some subdomain ofVos{t). We define 
Red(t,P) = t' as the unique element ofT(J^UQ) such that 

1- Vos{t') = P 

2- t t' 

The existence and unicity of such a term Red(t, P) follows from the technical 
conditions imposed by Definition 2.4. 

Lemma 2.7. Let A be some standard f.t.a over the signature T . Let t, ti, t2 G 
T{F\JQ). Ift -^*^ ti,t^\ t2 and 7'os(ti) C 'Pos{t2), then t2 ii. 

Proof. Since i ti, t t2 Red(t2, 7'os(ii)) andPos(ii) = Pos(Red(t2, 7^os(ii)), 
by Lemma 2.5, ti = Red{t2,Vos{ti)) which implies that t2 — ti. □ 

3 Bottom-up rewriting 

In order to define bottom-up rewriting, we need some marking tools. In the 
following we assume that is a signature. We shall illustrate many of our 
definitions with the following system {TZi , T) 

Example 3.1. 7^l = {f (x) g(x), g(h(x)) i(x), i(x) a}, T = {a, f, g, h, i} 
with ar(a) = 0, ar(f) = 1, ar(g) = 1, ar(h) = 1, ar(i) = 1. 

3.1 Marking 

As in [14], we may mark the symbols of a term using natural integers. 
Marked symbols 

Definition 3.2. We define the (infinite) signature of marked symbols; 

T^ = {r\feT,ieN}. 

For every integer k > we note: T-^ = {/* \ fGJ-,0<i<k}. The mapping 
m : J-^ — > N maps every marked symbol into its mark: m(/*) = i. 

Marked terms 

Definition 3.3. The terms in T{J-'^, V) are called marked terms. 

The mapping m is extended to marked terms by: 
if t G V, vn{t) = 0, otherwise, m{t) = m(root(i)). 

For every f eT,we identify f° and /; it follows that J' C J^'^, T{T) C T{J^^^) 
and T{T,V) cT{T^,V). 

Example. m{a^) = 2, m(i(a2)) = 0, m{h\a)) = 1, 
m{h^{x)) = l,m(a;) =0. 
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Definition 3.4. Given t E T{!F^\V) and i eN, we define the marked term f 
whose marks are all equal to i: 

if t is a variable x t^ = x 

if t is a constant c = 

otherwise {t = f{ti, . . . ,tn))where n>l ~ ■ • ■ ,tn^) 

This marking extends to sets of terms 5* (S** — {t^ \ t E S}) and substitutions 
a {a'^ : X !—>■ {xaY). 

We use mmax(t) (resp. mmin(i)) to denote tlie maximal (resp. minimal) 
mark of a marked term t. 

mmax(<) = max{m(t/w) | u E Pos{t)} 
mmin(t) = min{m(i/-u) | u E 'Pos{t)} 

For u E 7^05+ (<), mmax<"(t) = max{m(t/w) \ v < u}. 

Example. mmax(i(a2)) = 2, mmin(i(a2)) = 0, mmax<i i(g(hi(a2))) = 1. 

Notation: in the sequel, given a term t E T{J-, V), t will always refer to a term 
of T{!F^,V) such that t = t. The same rule will apply to substitutions and 
contexts. 

Finite automata and marked terms. Given a finite tree-automaton A = 
{J^, Q,Qf,r) we extend it over the signature T-'^, by setting 

:= {(/^(gf ,...,qi-)^q^) \ {f{q,, ...,q,,^q)E F, j, ji, . . . , j„ G [0, fc]}, 

and 

Since, for every integers k, k' , A-'' and A-'' have the same action on terms 
with marks not greater than min(fc,A:'), we often denote by A any extension 
A-'^ with a sufficiently large k w.r.t. the terms under consideration. 

N acts on marked terms. We define a right-action of the monoid (N, max, 0) 
over the set J-'^'^ which just consists in applying the operation max on every mark: 
for every t E J-^'^, n E N, 

Pos{tQn) := Pos{t), Vit E 7'os(i), m((i0n)/it) := max(m(t/u), n), (tQnf = tP 

Since a marked term can be viewed as a map from its domain to the direct 
product X N, and since the operation acts on the second component only 
while every f.t.a acts on the first component only, the following statement is 
straightforward . 

Lemma 3.5. Let A he some finite tree automaton over T, s,t E T(J-"^) and 
heN. Ifs ~^*^ i then (s n) (tQn). 
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Marked rewriting 

We define here the rewrite relation o— » between marked terms. For every hnear 
marked term t S T{!F^,V) and variable x £ Var(F), we define: 



M(t, x) ^ svip{m{t/w) I w < pos(t, x)} + 1. 



(6) 



Let TZ he a left-linear system, s G T{T^'^) and t <E T. Let us suppose that 
s G T(jr'^) decomposes as 



'=€[1^]^, with (/,r)e7^, 



(7) 



for some marked context C[]« and substitution cr. We define a new marked 
substitution a (such that ct" = by: for every x £ Var(r), 



(xct) 0M(C[7],a;). 



(8) 



Wc then write s o— > i where 



(9) 



(This is illustrated by Figure 1, where M denotes M{C[l],x) and the marks 
are noted between brackets (. . .)). More precisely, an ordered pair of marked 



s 




t 


C 




C 




V 





I 

{m{s/w)) 
xa 

(k) 



(0> 



(max(A:, M))) 



Figure 1: A marked rewriting step 

terms {s,t) is linked by the relation o— > iff, there exists C[]^, (Z, r),I, ct and 
a fulfilling equations (7-9). The intuitive idea behind the above definition is 
that the marks are storing the relevant information concerning the ordering of 
successive positions of redoxes during the derivation. A mark k will roughly 
mean that there were k successive applications of rules, each one with a leaf 
of the left-handside at a position strictly greater than a leaf of the previous 
right-handsidc. 

The map s s- s*^ (from marked terms to unmarked terms) extends into a 
map from marked derivations to unmarked derivations: every 

So = C'o[IoCTo]i,o C'o[roCTo].i;o = si . . . C„_i[r„_iCT„_i]„„_i = s„ 

(10) 
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is mapped to the derivation 

^0 ~ Coi^oColuo ^ Coi^oCol'Uo = Si ^ ■ ■ ■ ^ Cji-1 [''ri-lO'„_i]^„_i = s„. (11) 

The context Ci[]t,., the rule (li,ri), the marked version li of li and the substi- 
tution ffi completely determine Si^i. Thus, for every fixed pair (sq, sq), this 
map is a bijection from the set of derivations (10) starting from sq, to the set 
of derivations (11) starting from sq. 

Example 3.6. With the system TZ of Example 3.1 we get the following marked 
derivation: 

f(h(f(h(a)))) f(h(g(hi(ai)))) f(h(i(a2))) f(h(a) 
g(hi(ai)) o^i(a2) o^a 

From now on, each time we deal with a derivation s — >* t between two terms 
s,t € T{!F^,V), we may implicitly decompose it as (11) where n is the length 
of the derivation, s = sq and t = Sn- 



3.2 Bottom-up derivations 

Definition 3.7. The marked derivation (10) is weakly bottom-up if, for every 
<i <n, 

kiV m(h) = 0, (12) 

h eV => sup{m(s7/u) \ u < Vi} ^ 0. (13) 

(Handling the case where some Ihs are just variables is worthwhile: for ex- 
ample the systems of [2], when viewed as term rewriting systems, have all their 
Ihs in V). 

Definition 3.8. The derivation (11) is weakly bottom-up if the corresponding 
marked derivation (10) starting on the same term s ~ s is weakly bottom-up 
(following the above definition). 



Remark 3.9. An alternative formulation for defining a weakly bottom-up deriva- 
tion is to say that no redex Ijffj is contracted at a position Vj strictly greater 
than a variable of a previous r^. This means, in some sense, that the reductions 
are made in a "bottom-up" fashion, see Figure 2 and Figure 3. 

We shall abbreviate "weakly bottom- up" to wbu. Note that the notion 
of wbu marked derivation is defined step by step. It is thus clear that the 
composition of two wbu marked derivations is wbu too. This might be false for 
wbu unmarked derivations. In the following we thus mainly manipulate marked 
wbu derivations. 

The next lemma shows that in the case of a linear system, a derivation can 
always be replaced by a wbu-derivation. 
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Figure 2: A wbu derivation 




Figure 3: A non wbu derivation 



Lemma 3.10. Let TZ be a linear system. If s — >^ t then there exists a wbu- 
derivation between s and t. 

Sketch of proof. We prove by induction on the integer n, that, for every deriva- 
tion s t, there exists a wbu-derivation from s to t, with the same length n 
and reducing the same redexes of s 
Basis:n = 

then s = t] the empty derivation is wbu. 
Induction stepiTi > 

As TZ is linear, every redex may have at most one descendant in each term of 
the derivation. We choose a maximal redex la {I ^ r E TZ) s among the set 
of redexes contracted somewhere in the derivation s — >" t; let u be the position 
of this maximal redex in s. A new derivation can be obtained by transferring 
the contraction of la at the beginning of the derivation: we obtain a derivation 
of equal length 

s = C[laU ^ C[raU t. (14) 

By induction hypothesis, the derivation C[rCT]„ — t can be made wbu while 
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preserving its length n — 1 and the set of redexes of C[r(T]„ that are contracted. 
Let us consider the unique marked derivation associated to (14): 



s = C[la]u C[ra]u 0^""^ t. (15) 

and the unique marked derivation associated to the wbu-dcrivation C[rCT]„ -^"^^ 
t: 

C[ra]u o^"-i t. (16) 



By assumption and preservation of the redexes, a does not contain any redex 
which is contracted inside the derivation C[ra\ — t. Hence, the [j + l)th 
step of derivation (15) uses a Ihs with a root that possesses the same mark as 
the root of the Ihs of the jth step of derivation (16). Since this mark is always 
null in (16), it is also null in (15). This shows that (14) is wbu. □ 

Definition 3.11. A marked term s is said m-incrcasing iff, for every u,v € 
Vo5{s),u :< V ^ m(s/u) < m{s/v). 

Lemma 3.12. Suppose that's is a m-increasing marked term, t E T{!F^ ,V)\V , 
m(t) ~ 0, CWv is a marked context, W is a marked substitution and s = C\i'a]y. 
Then, C'Wv has no mark above the position v. 

Proof. Let u £ Vos[C) such that u <v and C{v) = □. 
Since C|Ict] is m-increasing, 

m(C[iCT]/M) < m(C'^a]/v) 

But m(C\imv) = m(i) = 0. □ 

Lemma 3.13. Let s o t be a wbu marked derivation-step between s,t S 
T{!F^). If's is m-increasing, thent is m-increasing too. 

Proof. Suppose s is m-increasing and that s, t fulfil (7-9). Let us consider vi,V2 € 
Vos{t) such that vi :<V2. Let us show that 

m(t/vi) < m{t/v2). (17) 

We distinguish 3 cases depending on the relative positions of v,vi,V2 . 
Case 1: vi ^ v. 

Since the derivation step is wbu, we have m(s/w) = 0. According to the definition 
of a marked derivation-step, we have m{s/vi) — m{t/vi). Moreover, s is m- 
incrcasing. Hence, m{s/v) = m(s/t>i) = < m{t/v2). 
Case 2: v ^vi. 

If vi E V ■ Vos{r) then we have m(t/vi) = < m{t/v2). Otherwise, vi = 
V ■ w ■ wi,V2 = V • w • W2, where x is the label of w in r and wi, W2 £ Vos{xa). 
Since s is m-increasing, m{s/vi) < m{'s/v2), hence 

max(m(s/-!;i), M(C[7],x)) < max(m(s/t;2), M(C[I], x)) 
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i.e. m{t/vi) < m{t/v2)- 
Case 3: vi _L v. 

In this case we also have V2 -L v. It follows that for every i E {1, 2}, m(s/wi) = 
m(t/vi), and we can conclude as in case 1. 

In all cases we have established that (17) holds. □ 

The last Lemma generalizes to a sequence. 

Lemma 3.14. Let s o— >* t be a wbu marked derivation between s,t E T{J^^). 
Ifs is m-increasing, then t is m-increasing too. 

Proof. Straightforward induction on the length n of the derivation based on 
Lemma 3.13. □ 

Remark 3.15. Let us examine the value of M(C[I], a;) when s' o —>■* s = 
C[Ict] o-^ C[ra] — t, and s' — s is wbu; 

- if C is the empty context and I ~ x then M(C[I], x) = 1 

- otherwise, by Lemma 3.14, M(C\l],x) = m(C[I]//a;) + 1, where fx is the father 
of pos(C[I], x). 

We classify the derivations according to the maximal value of the marks. We 
abbreviate "bottom-up" to bu. 

Definition 3.16. A derivation is bu(fc) (resp. bu~{k)) if it is wbu and, in 
the corresponding marked derivation Vi G [0, n], mmax(si) < k (resp. Vi £ 
[0, n], mmax(Zi) < k). 

Let us introduce a convenient notation. 

Definition 3.17. Let k > 1. The binary relation fco— >^ over T{J-'^'^) is defined 
by: 

s fco— >^ t if and only if there exists a wbu marked derivation from s to t where 
all the marks belong to [0, k]. 

The binary relations fc— >^ over T{T) is defined by: 

s k^Tz t if and only if there exists a bu(fc) derivation from s to t. 

Example 3.18. For the system TZq = {^{H^)) ^ ^} with the signature T = 
{a("),f(i)}, although we may get a hii{k) -derivation for a term of the form 
f (. . . f (a) . . .) with fc + 1 f symbols: 

f(f(f(f(a)))) f(fi(f\ai))) f(f2(a2)) o-. f(a3) 

we can always achieve a bu(l) derivation: 

f(f(f(f(a)))) f(f(f(ai))) f(f(ai)) f(ai) 
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3.3 Bottom-up systems 

We introduce here a hierarchy of classes of rewriting systems and show that 
several well-known classes of rewriting systems are included in the low levels 
of this hierarchy: namely the right-ground systems, the left-basic semi-Thue 
systems, the linear shallow systems and the linear growing systems. 

Definition 3.19. Let P be some property of derivations w.r.t. Term Rewriting 
Systems. 

1- A Term Rewriting System {TZ,J-) is called P if for every s,i G such 
that s t there exists a P-derivation from s to t. 

2- A semi-Thue system (S*, A) is called P if the Term Rewriting System {J-{S), J- [A)) 
is called P. 

Wc denote by BU(fc) the class of BU(fc) systems, by BU~(fc) the class of 
BU~(fc) systems. We define the class of bottom-up systems, denoted BU, by: 

BU = y BU(fc) 

fcGN 

Lemma 3.20. Let k > 0. BU{k - 1) C BU"(fc) C BU(fc). 
Lemma 3.21. Every right-ground system is BU(0). 

Proof. The right-handsides being ground no mark (> 0) is ever introduced by 
o^. □ 

Lemma 3.22. Every inverse of a left-basic semi-Thue system is BU^(l). 

Proof. Let {S, A) be a semi-Thue system such that S^^ is left-basic. The com- 
binatorial restrictions defining the property "left-basic" (see, for example, con- 
ditions C1,C2 of section 2.5 in [28]) imply that, in every marked wbu-derivation 

w# o^>(s) au/3# o-'j^is) 

with w e A*,u ^ V e S,a,(3 e {A^)* , we must have 

mmax(?l) = 0. 

Hence TiS) G BU"(1), so that, by point 2 of Definition 3.19, S G BU"(1). □ 
Lemma 3.23. Every shallow system is BU^(l). 

Proof. Let 7?. be a shallow set of rules. This means that every rule I ^ r ^ TZ 
is such that / is a variable or all the occurrences of variables in I have depth 1. 
Let us consider a wbu-dcrivation of the form (10) starting on some unmarked 
term s. Every marked term li either has no mark (because it has depth 0) or 
has only one mark above each occurrence of variable: the mark of the root of li. 
In this case m{li) = 0, by Definition 3.7 and because, by Lemma 3.14, Cilkai] 
is m-increasing. Hence Definition 3.16 is fulfilled by the given derivation. □ 
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Lemma 3.24. Every growing linear system is BU(1). 

Proof. Let 7?. be a linear growing system over a signature and G T{J-). 
We prove by induction on the integer n that: if s = sq o— > . . . s„ — t is 
a wbu-derivation then in the corresponding marked derivation , Vi, < i < n, 
mmax(s7) < 1. 

If n = then mmax(so) — because sq = sq = s G 
Otherwise, we may write: 

So — > • • ■ ^ Sn ^ S„_|_i — I 

The induction hypothesis yields Vi, < j < n, mmax(s7) < 1- 

From mmax(s^) < 1, we get mmax(C„) < 1 and mmax(xavi) < IjVx e Var(^„). 
We also have mmax(r„) = 0. 

If Var(r„) = then s„+i = C„[r„] and mmax(s„+i) < 1. 

If Var(r,i) 7^ 0: Vx G Var(r,i), xovT = (a^ovT)* with i = TOaa;(mmax<"^^„) + 
1, mmax(xavr)) by definition of o^. In addition x is at depth 1 in In 
because TZ is growing, hence mmax<"^(^„) = m(^„). We have m(7„) = 
because the derivation is wbu. So i < 1 and m(x5vr) < 1- 
From mmax(C„) < 1, mmax(r„) = and Va; G Var(r„), mmax(xavi) < 1, 
we get that mmax(a;oVi') < 1- 

We conclude that mmax(C„[r„avr]) < 1 so that mmax(s„+i) < 1. 

□ 

Example 3.25. 

The system TZq ~ {f(f(x)) x} G BU~(1) and TZo is not growing. 
The system TZi of Example 3.1 belongs to BU~(2) and TZi is not growing. 
The systemTZ2 ~ {H^) ~* §(2^)7 h(g(a)) —>■ a} is growing and belongs to BU^(l). 
The system TZ^ = {f{x) g(a;), g(h(a;)) a} is growing and belongs to BU(1). 

Corollary 3.26. LinearGrowing C BU(1). 

4 Inverse-preservation of recognizability 

Let us recall the following classical result about ground rewriting systems 

Theorem 4.1 ([5]). Every ground system is inverse-recognizability preserving. 

This theorem was further refined and extended in [10, 9, 8], see [6] for an 
exposition. The main theorem of this section (and of the paper) is the following 
extension of Theorem 4.1 to bu(fc) derivations of linear rewriting systems 

Theorem 4.2. Let TZ be some linear rewriting system over the signature T , let 
T be some recognizable subset ofT{J-) and let k >0. Then, the set ( fc— 
is recognizable too. 
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4.1 Basic construction 



In order to prove Theorem 4.2 we are to introduce some technical definitions, 
and to prove some technical lemmas. Let us fix, from now on and until the end 
of the subsection, a linear system {TZ, T), a language T C T{T) recognized by a 
finite automaton over the extended signature J^U {□}, A = (JFU {□}, Q, Q/, F) 
and an integer > 0. In order to make the proofs easier, we assume in this 
subsection that: 

V/^re7^,^^V, (18) 

A is standard. (19) 

We postpone to §4.2 the proof that these restrictions arc not a loss of generality. 
Let us define the integer 

d := max{dpt{l) | ? ^ r £ 7^}. 

Definition 4.3 (Top domain of a term). Lett^ r{{TijQ)-^,{U}). We define 
the top domain oft, denoted by Topd(t) as: u G Topd(t) iff 

1- u£ Vosit) 

2- Vui,U2 G N* such that u = ui ■ U2, either m(t/ui) = or |u2| < (fc + 1 — 
m(?/ui))d. 

Wc then define the top of a term i, which is, intuitively, the only part of 
t which can be used in a bu(fc)-derivation starting on t. Everything below this 
part is merely included in the substitutions used by the derivation-steps, and 
thus copied. Such copied parts of t can be handled just by a state of the f.t.a. 
A. 

Definition 4.4 (Top of a term). For every t e T((J^ U Q)-'', {□}), Top(?) = 
Red(t,Topd(I)). 

This definition extends naturally, in a pointwisc manner, to substitutions. 

Lemma 4.5 (Top is morphic). Let C[] he a context with no mark above the 
symbol □ and let t be any marked term in T{{T U Q)-''). Then Top(C[<]) = 
Top(C)[Top(t)]. 

The proof is easy and therefore omitted. 

Lemma 4.6 (Top preserves unmarked terms). If t E T{!FU Q) and ct : V ^ 
T{{T U Q)f then Top(tW) = tTop(a). 

Proof. The proof is easy and therefore omitted. □ 

Lemma 4.7 (Top is decreasing). Lets,t e T{{!F U Q)^) be such that 'Pos{'s) = 
Vo5{t) and, for every u G 7-'os(s), m{s/u) < m{t/u). 
Then 7'os(Top(s)) D 7'os(Top(t)). 
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Definition 4.8. We consider the following ground rewriting system S over 
T{{J-'U Q)-^) consisting of all the rules of the form: 



where I r is a rule of TZ 



It — > rr 
m(7) = 



(20) 
(21) 



and T : V ^ '^{{^ U Q)^'") o, marked substitution such that, Vx G Var(Z) 

xf = xtQ M{1, x), dpt{xf) < fc • d. (22) 

Lemma 4.9 (lifting S\JAto U). 

Let s,s',t G T((jF U Q)~^) such that s' is m-increasing. If s' s and 

s ^5u^ ^ then, there exists a term t' € T((.FU Q)-^) such that 
s' fco— >7j t' and t' -^*^ i. 




* 




so 


t 


n 




A \ * A 


: * 


1 


1 


-1 * 




s ^ 


t 


SyjA 





Lifting <S U .4 Projecting TZ 

Figure 4: Lemma 4.9 and 4.16 

Proof. 1- Let us prove that the lemma holds for s -^suA t- Let us suppose 

that s' — >^ s -^A t- Let us then choose t := s'. It satisfies: s' i^o^^ t' and 
t' = s' s -^A t. Hence the conclusion of the lemma holds. 
Suppose now that s — >5 t. This means that 

s^C^t], t = C[rf] 

for some rule I ^ r E TZ, marked context C, and marked substitution t, satis- 
fying (21-22)^ 

Since s' — >^ s it must have the form 

s' = C[It'] 

where, for every x G Var(Z), xt' x^. Let us set 



XT := XT 



GM{l,x), t' := C[rT 
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Since s' is m-increasing, M{l,x) = M{C[l], x). Hence, by definition of o^, 
s' o~^n t' and by condition (21) this step is wbu, i.e. 

By Lemma 3.5, for every x, 

xf' = xt' M (7, x) XT M (I, x) = xf. 
Hence t = C[rr ] C[7'f ] ^ t. 

2- Let us prove, by induction over the integer n > 0, the statement 
Vn e N,Vs,7,t e TiiTUQ)-'') 

(7 m- increasing & s' s & s -^Su^ ^) ^ (s' fco^^ ¥ k¥ -^*^t). (23) 

(here i' is imphcitly quantified over T{{!F U Q)-'^))- 
Basis: n = 0. 

In this case s' = s. Choosing t' := Z, the conclusion of imphcation (23) holds. 
Induction step: n > 1. 

Let us suppose that the hypothesis of implication (23) holds. There exists a 
term i eT^TU Q)-^) such that 

By induction hypothesis, there exists some t' G T((jru Q)-^) such that 

s' feo^^ t' & i' i. (24) 

By lemma 3.14 t' is m-increasing and by point 1 of this proof, there exists some 
Fe r((J^UQ)^'=) such that 

i' kO^*T^T ^\t. (25) 

Putting together statements (24) and (25), we obtain the conclusion of implica- 
tion (23). □ 

Remark 4.10. The assumption that A is standard (19) is not used in the above 
proof. Hence Lemma 4-9 also holds without this restriction. 

Lemma 4.11 (projecting one step of 7^ on 5 U .4). 

Lets,teT{{TUQ)-'') such that: 

1- S o^Ti ^' _ 

2- The marked rule {l,r) used in the above rewriting- step is such that m{l) = 0. 

3- s is m-increasing. 

Then, Top(s) ^^^s Top(I). 
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Proof. Let us assume hypotheses (1,2,3) of Lemma 4.32. In particular: 

s^C[la], t^C[rW\ 

for some C,a,l,r,'a fulfiUmg (7-9) and m(7) = 0. Let us then define a context 
D and marked substitutions r, t by: 

:D[]=Top(CD). (26) 

Vx e V, XT = Top(xct), XT ~ Red{xa,'Pos{xT)). (27) 

We claim that 

Top(s) -^*^ D[It] -^s D[rW] = Top(t). (28) 
We cut into four facts the detailed verification of this claim. 
Fact 4.12. 7'os(lTop(^)) C 7'os(Top(I^)). 

7 xa 



mi m 




case 2 

Figure 5: Fact 4.12 

Let u e 7'os(ITop(ct)). 
Case 1: u E Vos^(l). 

In this case |m| < d. Hence, for every factorization u = ui-U2, since m(7/ui) < k, 
\u2\ < |m| < d < (fc + 1 - m{t/ui))d. 

Case 2: 

U = V ■ w 
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for some X 6 Var(Z),z; = pos(I, x) , w £ Topd(a;CT). Let us consider any decompo- 
sitfon u = ui ■ U2 and show it fulfils condition (2) of Definition 4.3. 
Wc use the notation 

rai = m{la / ui) ^ m — m{la/f) 

where / is the father of v. If mi = this condition (2) is clearly true. Let us 
assume that mi > 1. 
Case 2.1: ui ^ v. 

In this ease there exists u'l such that 

V — uiu[, U2 ~ u[w, \u[\ > 0. 

As w g Topd(xCT), 

\w\<{k + l~ m(.T^))d (29) 
but m{xa) > M(1,x) = m + 1, hence 

|w| < (fc + l-m-l)d. (30) 

Using the fact that |mi| < dptil) < d we obtain that 

\u[w\ < (fc + 1 - m - l)d + d = (A; + 1 - m)d (31) 

and, since the marks increase from top to leaves, m > mi, so that 

\u[w\ < (fc + 1 - mi)d (32) 

which can be reformulated as 

|m2| < (fc + 1 - m(I^/Mi))d. (33) 

Case 2.2: v -< ui. 

In this case there exists u[ such that 

ui = vu'i, u'iU2 — w, ju'il > 1. 

As It) G Topd(.w) 

|u2| < (fc + 1 - m(a;^/Mi))d (34) 

which can be rewritten 

|u2| < (fc + 1 - m(I^/Mi))d. (35) 

Since in all cases condition (2) of Definition 4.3 is fulfilled. Fact 4.12 is estab- 
lished. 

Fact 4.13. Top(Ict) -^^'It. 
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We know that 

la -^*^ Top(Ict) (36) 

(by definition of Top) and tliat 

la -^*^ It (37) 

because, by (27), every xt is a reduct of the corresponding xa. Moreover, by 
Fact 4.12, 

Pos(7t) = 7'os(lTop(^)) C 7'os(Top(Ict)), 
and by Lemma 4.7 'Pos{Top{la j) C 7'os(Top(Ia)), so that 

Vos(It) C Pos(Top(7ct)). (38) 

Lemma 2.7 applied to (36-38) shows that Top(7ct) It. 

Fact 4.14. D[It] -^sD[rf]. 

By hypothesis (2) of the lemma, m(Z) = 0. 

By the general assumption (18) and hypothesis (3) of the lemma, 
Va; G Var(0, M{l,x) = M(C[l],x), 

hence 

xf := XT G M(I, x). 

Moreover, dpt{xT) < (fc + 1 - M{l,x)) ■ d < fc ■ d, since M{l,x) > 1. Hence 
It —> rr is a rule of S. 

Fact 4.15. D[rf] = Top(t). 

This fact follows from Lemma 4.5 and Lemma 4.6. 
Using these facts we obtain that 

Top(s) = D[Top(7ct)] ( by Lemma 3.12 and Lemma 4.5) 

:D[Top(7ct)] -^*^ D[It] ( by Fact 4.13) 

D[It] -^s D[rf\ ( by Fact 4.14) 

D[rf\ = Top(Z) ( by Fact 4.15). 

Thus claim (28) is verified, which proves the lemma. □ 
Lemma 4.16 (projecting 7?. on 5 U A). 

Let s, i e T{J--^) and assume that s is m-increasing. If J ^.o— >^ t then, there 
exist terms s' ,t' G T{{TU Q)-^) such that 



t' and t — t' . 
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Proof. The marked derivation s o— >^ t is wbu, hence it can be decomposed into 
n successive steps where the hypothesis 2 of Lemma 4.32 is valid. Hypothesis 
3 of Lemma 4.32 will also hold, owing to our assumption and to Lemma 3.14. 
We can thus deduce, inductively, from the conclusion of Lemma 4.32, that 
Top(s) — >5u_4 Top(I). The choice s' := Top(s),t := Top(I) fulfills the conclusion 
of the lemma. □ 

Lemma 4.17. Let s G T{T). Then s T iff s -^*sua QT ■ 

Proof. 

{=>)'. Suppose s k^Tz t and t G T. Let us consider the corresponding marked 
derivation 

s fco^^ t (39) 

where s s. Derivation (39) is wbu and lies in 'T{T-^). Let us consider the 
terms s',Z given by Lemma 4.16: 



and t — >*- t' . Since t Qj^, by Lemma 2.7 



SLlA 



(40) 



t'^\Qf- (41) 



Combining (40) and (41) we obtain 



(<S=): Suppose s ^s^JJ^ £ Qj''. 

The hypotheses of Lemma 4.9 are met by s := s,s' := s and t = q> . By 
Lemma 4.9 there exists some t' 6 r((J-UQ)^'=) such that 

s feo-^^ t' e Qj''. 

These derivations are mapped (by removal of the marks) into: 

s fe^^ t' ^*j^q£ Qf, 

which shows that t' E T hence that s k^ji T. □ 

We can now prove Theorem 4.2. 

Proof By Lemma 4.17, ( k-^n)[T] = {~'*suA){Qj'']'^'^i^)- The rewriting sys- 
tems S and A being ground arc invcrse-recognizability preserving (Theorem 4.1). 
So {—^suA)iQj ] recognizable and thus ( fc-^7j)[T] is recognizable. □ 

Corollary 4.18. Every linear rewriting system of the class BU is inverse- 
recognizability preserving. 

Proof If n belongs to BU(/s), then (^^)[T] = ( fc^^)[r]. □ 
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Remark 4.19. In the above proof of corollary 4-18 we could use the ground 
rewriting system iJA over the signature T ( recall that is obtained from S 
by forgetting the marks): when TZ belongs to BU(fc), 

H^)m-H50u^)[Q/]nr(^). 

This also gives an effective way for computing a ft. a recognizing {^tz)[T]. 

Example. WithTZ of example 3.1 and A = {T, {(/a}, {^a}, {a — > qs}) recognizing 
T = {a}, wc obtain 

'S" ^ {a ^ q,} U {ffe) 9{q.),g{Kq,)) -> \{q,), \{q,) a} 

Example. The derivation f (h(f (h(a)))) o— >* a given in Example 3.6 may be 
simulated by : 

f(h(f(h(a)))) f(h(f(hfe)))) -so f(h(g(hfe)))) -50 

^{m<l.))) -^SO f(h(ga)) ->50 g(h(<?a)) -5" ife) - 3 

4.2 General construction 

We show here that Theorem 4.2 still holds when the restrictions (18-19) are 
removed. 

4.2.1 Allowing variable Ihs 

Let TZ be some left-linear finite rewriting system over the signature J-. Let us 
introduce a new unary symbol f\^T and consider the signature T\ := .FU{/i}. 
We then define 

— r e 7^, Z ^ V} U {C\l\ — C\r\ \ l^r(^n,C context , dpt{C) = 1}. 

It is clear that TZi is a left-linear finite rewriting system over the signature J-i 
and every rule {li, ri) £ TZi is such that li ^ V. 

Lemma 4.20 {TZ embeddable in TZi). For every s,t <E T{!F) and integer k > 0, 

1- s^*^t^f,is) ^lk^ hit) 

2- s k^\t^ fi{s) fe^^^ 

In particular: s k-^*^ T ^ fi{s) u^*^^ fi{T) and TZ is BU(fc) iff is BU(fc). 
Hence Theorem 4.2 and Corollary 4.18 still hold, without assuming (18). 

4.2.2 Allowing non-deterministic automata 

Let TZ be some left-linear finite rewriting system over the signature J- fulfilling 
restriction (18) and let A = [T ., Q, Qf, F) be some f.t.a recognizing a language 
T (this f.t.a is not assumed standard, nor merely deterministic). 
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Automaton A Let us define A := {T, J-q U Q, Tqj U Q/, f ) by: 



Q ■■= V{Q) 

Ta,f := {a e ^0 I a e L{A)} 

Qf := {PeQ\PnQf^9} 
f := {/(Pi, . . . , P,„) ^ P I ?7i > 1, / e T,n,Pi, ...,P,neTaUQ,P = Qn [f{Pu . . . , P™)]-.^} 

Some precisions about our notation: 

- in the last definition the P, which are equal to an element a, S are identified 
with the singleton {ai\ in the notation [/(Pi, . . . , Pm)]^;;^. 

- with this convention, /(Pi,...,Pm) denotes the set {f{pi, . . . ,pm) \ Pi € 

Pi, • ■ • ,Pm € Pm}. 

- [/(Pi, ■ ■ • , Pm)]~^'\ denotes the set of descendants of /(Pi, . . . , Pm), as defined 
in equality (5). 

Note that this construction of A from A is just a slight variant of the usual 
powerset-construction. We still denote by S the system deduced from TZ and 
A along Definition 4.8; we denote by S the system deduced from TZ and A 
along Definition 4.8. We shall show that the structures (T(jru Q), ^su^l) and 
{T[T U Q), — '■ju^) ^'^^ very close one to each other. A precise formulation will 
be given in terms of simulation (sec Definition 2.1). 

Lemma 4.21. For every f.t.a A, the f.t.a A is standard. 

This lemma follows immediately from the above definition. 

Simulations for A and A We define a binary relation «C [!Fq U Q U V) x 
{ToUQU V) by: 

«:= {(a, a) I a e To}U{{v, v) \ v e V}U{{p, P) \ p e P, P e Q}U{{q, a) \ q e Q,a e J^o,a q}. 

We extend « into the binary relation -C T{{T U Qf,V) x T((J^ U Qf,V) 
defined as follows 

Definition 4.22. For every t, t G T{{J^ U Q)^, V) x r((J^ U Q)^, V), t - t if 
and only if 

1- 'Pos(f) = 'Pos(t) 

2- Vu G Xn(t),f(u) = i{u) 

3- Vu G £v(4 m(t/M) = m(t7M) & ¥^{u) « {"(m). 
Lemma 4.23. 

is a simulation of {T{{TUQf,V),^s) by {{T{T U QY\V),-^^). 
2- IS a simulation of {T{{TUQf,V),'^g) by {T{{T U Qf ,V), ^s)- 



Proof. 

Point 1 Let us suppose that s, t G T((J^UQ)^, V), 7 G (T(J^UQ)^, V) are such 
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Figure 6: Lemma 4.23 



that s s' and s -^s t- 
Thus 



for some context C[], rule I ^ r G TZ and substitutions t, r fulfiUing (22). By 
Definition 4.22 the term s' has the form 



s' = C'[1't'] 



with 



(42) 

C ~ C^, (43) 

Vx- e Var(/),a;f ~ XT'. (44) 

Every label of a leaf of 7 belongs to U V. Relation (42) thus imphes that 
I ~ I', hence that 

r ^ r e 7^. (45) 

Let us define 

F := C'[rP] 

where the substitution r' is defined on every x £ Var(Z) by 

xP := xP Q M(F,x). 

Relation (44) implies that, for every x E Var{l), {xt)qM(1, x) ~ {xt')QM{1' , x) 

i.e. 

XT ~ xt' . (46) 

Statement (45) shows that s' t' while relations (43) and (46) show that 
t ^ t'. Point 1 of the lemma is thus proved . 
Point 2 

Let us suppose th_at t e T{{T U Q)^,V),7',¥ G {T{T U Q)^ ,V) are such that 
t ^ t' and s' — >^ t'. 

We know that 
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for some context C'[], rule I r £ TL and substitutions r',r' fulfilling (22). 
Since t ~ t' we must have 

t = C[rf] 

for some C[] C'[] and some substitution t fulfilling 

Vx G Var(r), a:? XT'. (47) 

Let us define 

s := CH (48) 
where the substitution is built in the following way: 

Vx G Var(Z),7'os(.Tcr) := Vosixr) 

the marks are defined by 

Va: e Var(;), Vu e 'Pos{xa), m{xU/u) := m(2;T'/u) (49) 

and the labels of the unmarked underlying substitution are defined by 

\/ X £V'ar{r),\/u £ Vos{xf), xa{u) := xt{u) 
Vx G Var(Z) \ Var(r), Vw e Xn(xo'), x(t(u) := xt'{u) 
\fx eVar{l)\Var{r),\fu e Cy{xa),xa{u) « xt'{u) 

(this last choice can be made because, for every symbol a' G JFq U Q there exists 
some a G J^o U Q, such that a « a'). 

From (48), the construction of a and (47) follows the property that 

s ~ s' . 

Since the marks of a" are taken from those of r' (see (49)), using the hypothesis 
that s' — s-g t' , we obtain that 

s —^s t. 

□ 

Lemma 4.24. 

1- ^ IS a simulation of {T {{T Qf\V) , ^ a) by iT{{T U Qf ,V), 

2- is a simulation of {T{{T U Qf, V),^j) by {T{{T U Qf, V), ^^). 

Proof. 
Point 1 

Let us suppose that s,i G T {{T \J Qf^ ,V) ,7 G (T(J^UQ)^,V) are such that 
s ^ s' and s t. 

Thus __ _ 

s = C[I], t = C[r], s' = C'[;'] 
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Figure 7: Lemma 4.24 



for some contexts C[], C"[] and rule l^r eT fulfilling C[] ^C'[],l^l'. 
case 1.1: I ^ r = a ^ q for some a G J^Qj q ^ Q- 
We define 

F Z (50) 

Since a q , the relation g « a holds, hence t ^ t' . It is clear that s' t'. 
case 1.2: I ^ r = f{pi, . . . ,p,„) ^ p for some to > 1, / e J>„,pi, . . . ,Pm,p £ 
Q. 

We thus have 

~l =7 {Pi, r=p, I' = 7{Pi,...,Pm) 

for some Pi G (Jb U Q)^ such that « P;. Let us define P' G Q'*' by 

P' :^Qn[fiPi,...,Prn)W^, m(P^):=m(7) (51) 

and finally 

¥ ■.^C'[P']. 

Sinc_e C P^ (if P, G Q) or_Pi (if P, G ^q), P G P'. It follows that 

t ~ f'. The definition (51) of P' also implies that /(Pi, . . . , Pm) — >^ P', hence 
that 7 F. 

Point 2 

Let us sup£Ose that t G T((J^ U Q)'^, V),7,F G (TiTUQf^V) are such that 
f ~ and s' f'. 
We know that 

t = C[p], 7 = C^[F], F=C^p] 

for some contexts C^[], symbol p e (TUQfuV and rule V ^ r' e f. Such 
a rule has the form 

l' = f{Pi...,P.^)^P'=r' ^ ^ (52) 

where P^' G J^o U Q, P' G Q. Since t ~ F, we must have C C' and p G P'. 
Since p G P' there exist pi, . . . ,Pm G JFq U Q such that, /(pi, . . . ,Pm) P 
and, for every i G [1, to], either pi = P/ G J^o or (p.; G Q, P/ G (5,Pi G P/). Let 
us define s by 

s := C[f{pi, . . . ,Pm)], Vu G Pos(s), m(s/u) = m{s'/u). 
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Since pi « P/ we get that s ^ s' and since f{pi,... ,Prn) — P we get that 

As a straightforward consequence of Lemma 4.23 and Lemma 4.24 wc get 
the foUowing lemma. 

Lemma 4.25. 

1- - IS a simulation of {r{{T \jQf\V),^sijA) by {T \J Qfy) , ^l^^) . 

2- IS a simulation of {T{{T {J Q)^,V),^s^^) by (r((J'U Q)f^, V), ^su^). 

Let us show that Theorem 4.2 still holds without assuming (18-19). By 
§ 4.2.1 wc arc reduced to treat the case of a system TZ fulfilling (18). Since A 
is standard, Lemma 4.17 applies on TZ and Using then Lemma 4.25 we 

get that for every term s G T{J-) 

S k-^-R J S ^ f 

and we can conclude, as before, that ( k^Ti)\^\ rational. 
4.3 Complexity 

The proofs that we gave for Theorem 4.2 are constructive i.e. give an algorithm 
for computing a non-deterministic f.t.a recognizing ( k^Tz)\T] from a non- 
deterministic f.t.a recognizing T and a system TZ which belongs to the subclass 
BU(fc) We sketch here some estimation of the complexity of this algorithm: in 
§4.3.1 we treat in details the case of semi-Thue systems belonging to BU^(l) 
and, later on, the case of term rewriting systems in BU~(1); in §4.3.2 we sketch 
an analysis of the more general case of systems in BU~(fc), for any natural inte- 
ger k. In §4.3.3 we prove a NP-hardness lower-bound showing that some of our 
upper-bounds cannot ( presumably) be significantly improved. 

4.3.1 Upper-bounds for systems in BU (1) 

Let us treat here the case where TZ belongs to the subclass BU^(l). 

Semi-Thue systems 

Theorem 4.26. Let T be a signature with symbols of arity < 1, let A be 
some f.t.a recognizing a language T C T{J-) and let TZ be a finite rewriting 
system in BU~(1). One can compute a f.t.a B recognizing (^7j)[r] in time 
0{\T\-\og{\:F\f-\\Af-\\TZf)- 

Our proof consists in reducing the above problem, via the computation of 
the ground system S of Section 4.1, to the computation of a set of descendants 
modulo some set of cancellation rules, which is achieved in cubic time in [2]. 
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Proof. 

Step 1: We consider here the case where X is a fixed alphabet without arities, 
T is a recognizable subset of X* and 7^ is a semi-Thue system over X . 
We consider the right-linear one-step derivation relation generated by TZ: for 
every m, u S X* , u h—^tz v iff there exist w G X*, / — > r G 72, such that 

u = w ■ I, V = w ■ r. 

The binary relation is, as usual, the reflexive and transitive closure of 

ri—^TZ- Let us construct the symmetric alphabet associated with X by adding a 
twin-letter x' for every letter x £ X: 

X' := {x \x(z X}, X ■.= X\J X'. 

The map x ^ x' \s extended to X* by 

{x±X2 ' ■ ■ Xi • • ■ Xn) '— X^ • X.^ ■ X2 ' X-^. 

We then define a rational set TZ and a semi-Thue system V by: 
n {I'r \ l^re ny, V ■= {{xx'.e) \x<^X}. 
It is proved in [3] that: for every u, v G X* 

Hence 

[TK n^n) = [Tn*K^*v) (53) 
A non-deterministic f.a. recognizing T • TZ* can be computed in time 0(||A|| -I- 

\m)- 

By the main result of [2], for every recognizable set R , a non-deterministic 
f.a. recognizing can be computed in time 0(71"^) where n is the size 

of an automaton recognizing R. Hence a non-deterministic f.a. recognizing 
[^7^*](^^) can be computed in time 0((||A|| + \\n\\))^. 

Step 2: We consider here the case where X is an alphabet without arities 
(which is not fixed anymore), T is a recognizable subset of and 7?. is a semi- 
Thue system over T. 

Suppose that X := {xi, . . . , Xn} and n ~ 2^. Let (p : X* — + {a, b} be some sufRx 
encoding. For example we can define ip{xi) as the i-th word in {a, b}^ for some 
total ordering over {a, b}^. One can check that 

[T]{ ri^n) - V^HMm ri-^Un)))- (54) 

A f.a. A' recognizing ip{T) can be computed from A in time 0(||A|| • p) i.e. 
0(||A|| • log(|X|)). Using the result of step 1, a non-deterministic f.a. A" recog- 
nizing [(y5(r)]( d^l^cji)) can be computed in time 
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0((log(|X|)(P|| + ||7e||))^). A f.a. B can be obtained from A" by the classical 
construction for the operation (p^^. This gives a complexity: 

oi\x\-M\x\)i\\A\\ + \\n\m. 

Step 3: We consider here the case where T is a signature with arities in {0, 1}, 
T is a recognizable subset of T{T) and {TZ^J-) is a rewriting system in BU~(1). 
Let S -.^ {It ^ TT \ I ^ r e n,T : V ^ Q}. Since 7^ is BU~(1), by a small 
variation of Lemma 4.17, for every s G T{J-), s — >^ T -i^ s -^g^^ Qf- Thus 

i^n)m = [Q/](^(5u^)-0 nr(,F). (55) 

Let us denote by C the ground rewriting system {S U A)^^ . Let us notice that 
T(jr) is a subset of J-* . Moreover T{!F) is saturated by h—^c ^'^'^ ^^'^ relation 
— >J restricted to T{!F) coincides with restricted to T{!F). We can thus 

apply the results of step 2: as ||5 U ^|| < (||72.|| • ||^||), a f.a. recognizing 
[Qf]i^c I-^]) '^^^ computed in time 
0(1^1 • (log(|.F|)(|!7e|| • \\A\\ + \\n\\)f) hence in time 

0(1^1 • (log(|^|))3 . ||7^||■V p||3). 

□ 

Let us recall that every left-basic semi-Thue system can be viewed as a 
BU~(1) term rewriting system (Lemma 3.22). This Theorem 4.26 thus extends 
[2] (where a cubic complexity is proved for cancellation systems over a fixed 
alphabet) and improves [1] (where a degree 4 complexity is proved for basic 
semi-Thue systems). 



Term rewriting systems Let us turn now to term rewriting systems over 
arbitrary signatures. The following refinement of Theorem 4.1 has been proved 
in [10] 

Theorem 4.27. Let T be a finite set of terms and S be a ground term rewriting 
system. A f.t.a recognizing the set [T] can be computed in time which is a 
polynomial function o/Card(r) + ||5||. 

Given a system TZ we define 

A(7^) max{Card(Posv(0) \ l^r eU}. 

We extend the above complexity result into the following 

Theorem 4.28. Let {TZ,J-) be a finite rewriting system in BU^(l) and let A 
be some f.t.a over T . One can compute a f.t.a B recognizing (^^)[T'] in time 
polynomial w.r.t. \\TZ\\ ■ p||M^>'{A(TC)4}. 

Our proof consists in computing the ground system S of Section 4.1 and to 
apply Theorem 4.27. 
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Proof. Let us consider the system 



S := {It ^ rr \ I ^ r e n,T : V ^ Q}. 

One can check that 

< ||7^|| • IQ^^l 

By the same arguments as in the case of arities not bigger than 1, we still get 
that 

H^)m = [g/]Ha)nT(^) (56) 

where C :— {S U A)^^ . It is clear that 

||c|| < \\n\\ ■ igi'^*^) + \\A\\ < ||7^|| • \\Af^'^\ 

By Theorem 4.27, a f.t.a A' recognizing computed in P-time 

w.r.t. \Qf\ + \\C\\, thus in P-time w.r.t j|7e|| ■ ||^||max{ACR),i}. Let T' C T he the 
subset of symbols that have at least one occurrence either in the transitions of 
A or in the rules of TZ. By a direct product with the obvious f.t.a. recognizing 
T(jr') we can compute a f.t.a. B recognizing [Qf]{-^c) ^ '^{^')- The overall 
computation of B takes a P- time w.r.t \P\ ■ ||7^|| • ||^||max{A(TC)a}^ < 
||7?.|| + ll^ll- Hence the computation takes a P-timc w.r.t 

\\n\\ ■ pj|M-{A(7j),i}_ 

By (56), this automaton B recognizes (— '-' 
4.3.2 Upper-bounds for systems in BU~(fc) 

The upper-bounds resulting from the use of the precise system S from Definition 
4.8 would be unnecessarily high. Therefore we start this subsection by defining a 
smaller ground system Si C S. Subsequently we sketch a proof that the refined 
system Si can also simulate the original system TZ. Finally, we derive from this 
improved construction an upper-bound on the complexity of constructing a f.t.a 
for the set of ancestors of a regular set of terms. 

Let us define the set of subterms of the Ihs of TZ by 

SLHS(7e) {t e T{T, V) | 3^ ^ r e 7^, 3C G Ci{T),i ee„ C[t\]. 

In words: SLHS(7?.) consists of all subterms of Icfthand-sidcs of rules of TZ, up 
to renamings of the variables. 

Definition 4.29. Let TZ be some TRS over the signature J-, which has no empty 
lefthand-side (this is assumption (18)). We consider the ground rewriting sys- 
tem Si over T{{J- U Q)-'') consisting of all the rules of the form: 

It rf (20) 

where I —>■ r is a rule of TZ 

m(l) = (21) 
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and r : V — > T{{T U Q)- ) is a marked substitution such that, Vx G Var(Z) 



and, there exists substitutions r, 
Q such that 

T = Tl O T2 • • • O Ti O • • • O Tfe. 

Lemma 4.30. Si 'ZS 



= xfQ Mil, x) (57) 
V -> SLHS(7e) Jorl<i<k-l andTk-.V ^ 

(58) 



Proof. Conditions (20,21) arc those imposed on the rules ol S in Definition 4.8. 
For every x € V, Tiix) has a depth smaller or equal to d implying that the 
additional condition (22) of Definition 4.8 also holds. □ 

Lemma 4.31 (lifting Si\JAio U). 

Let s, s',t G T((jF U Q)-'^) such that s' is m -increasing. If s' s and 

s — I I A t then, there exists a term t' G T{{J- U Q)-'') such that 



t' and t' 



t. 



Proof. By Lemma 4.30, iSi C S. We assume that TZ has no empty l.h.s. so 
that, by remark 4.10, Lemma 4.9 holds. These two lemmas imply the above 
lemma. □ 

Lemma 4.32 (projecting TZ on Si U A). 
Let's,!, G T{J--^),q G Q-^ . If 's kO^Tz ^ ^-^ bu~(fc) derivation, s is m- 
increasing and t q then, there exists a term s' G T{{J- U Q)-^) such that 

s s' -^*s^uA 9- 
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Figure 8: Lemma 4.32 

In order to prove this lemma, by induction over the length of derivation 
s k° —^Tz t, we introduce a notion of decomposition of a term relative to a 
derivation (modulo TlU A) that starts from this term. Each component of the 
decomposition is called a cascade and the full decomposition is called a bunch of 
cascades. We shall examine, in the sequel, marked derivations of the following 
form: 

D : to o^-ji^*^ ■ ■ ■ o^Tj^^ ti o^Ti^*^ ■ ■ ■ o^T^^^ ti (59) 
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where the i-th step 

U o^TZ^*A U+1 (60) 

starts with an apphcation of a rule li — > 7'j G TZ. For every i G [0,^], wc denote 
by Di the subderivation 

Di : ti o^n^*A ■ ■ ■ 

Definition 4.33 (Cascade). Let D be some derivation (modulo TZUA) of the 
form (59). We define inductively, for pairs {h,i) e N x [0,£], the notion of 
cascade of level h w.r.t. A- Let S gT{Itu Q)^) andheN 
CI- If S is a subterm ofU and S € (^o U Q)^ , then S is a cascade of level h 
w.r.t. Di 

C2- If S is a subterm ofti, which has the form S ^ s ■ a (for some term s ^ V 
and .substitution a ) and EIA > such that, 

2.1 s has a residue in t^+A which is a subterm of the occurrence of li^\ ( used 
in the [i + \)-th step of D ) 

2.2 for every variable v £ Var(s), every residue ofv-a inside ti^\^i is a cascade 
of level h w.r.t. Di-^-x+i, 

then S is a cascade of level h + 1 w.r.t. Di. 

We call source of the cascade S the given occurrence of the factor s inti. 
(See figures 9-11). Note that: 



So 




Figure 9: A cascade of level 3: first step 

- every cascade of level /i w.r.t. Di is also a cascade of level /i + 1, /i + 2, . . . 
w.r.t. Di] 

- by Definition 4.33, if D is a marked derivation of length one, 

D : C[la] ^ C[r^, 
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Figure 10: A cascade of level 3: second step 



the given subterm la is a cascade of level 1 w.r.t. D iff cr maps every variable 
V e Var(0 n Var(r) into {To U Qf. 

Lemma 4.34 (Subcascadc). If S is a cascade of level h w.r.t. a derivation D 
and S' is a subterm of S , then S' is also a cascade of level h w.r.t. D. 

This can be proved easily by induction on h. When such a situation occurs, 
S' is called a subcascadc of S w.r.t. D. 

Wc define now a notion which generalizes the notion of a transversal with only 
null marks. This technical notion will be useful for analyzing our decompositions 
of terms in cascades. 

Definition 4.35 (Quasi-nuU transversal). Let t e T{{F\J Qf^), P = Pos(t) 
and let U = {uq, ui, . . . , be a transversal oft. 
The transversal U is said quasi-null iff 

Wv e P,Vi e [0,m],3j G [0,to], V ^ ^ {v ^ Uj & rrt{t/uj) = 0). 

In words: every node strictly above U is also strictly above some node Uj 6 U 
with a null mark. 

Definition 4.36 (Bunch of cascades). Let D,£ be given, as in definition 4-33. 
A term t G T((jru Q)^) is called a bunch of cascades w.r.t. derivation D iff, 
t = tQ and 'Pos(t) possesses a transversal U ~ (uq, ui, . . . , u„i) such that 
(BCl) every subterm t/ui is a cascade w.r.t. D 
(BC2) U is a quasi-null transversal 

(BC3) the occurrence of lo which is used in the first step of D is the source of 
one of the cascades t/ui. 
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Figure 11: A cascade of level 3: third step 



Figure 12 represents a bunch of cascades where m = 8, the cascades at nodes 
uq, ui, Us, M5, Mg, Us have level 0, at nodes U2, U4 level 3, at node uj level 2. 

Lemma 4.37 (projecting one step of 7?. on a cascade). 
Let 

Di : t'l o^Ti^\ t'2 o^n^*A ■ • • o^tc^^ • • • o^k-^a 

and let to, G T{J--^) be m-increasing terms such that 

tQ k°^n ti ti. 

If t'l is a bunch of cascades w.r.t. Di, then, there exists a termt'o G T((jFUQ)-'^) 
such that 

to -^A *o k^—^n-^A ^1, 
and tg is a bunch of cascades w.r.t. the derivation Dq obtained from Di by 
extension on the left by the step Iq o~^tz~^a ^i- 

Diagram: 

^0 k°^n ti 

I * I * 

-Do : tQ k°^n^A o^n^A ' ' ' °^'JI^a 

Sketch of proof. 

Let ^0 ^ 7*0 be the rule used in the one-step derivation to o^-ji ti. Let U = 
(uo, . . . ,Ui, . . . , u„i) be a transversal furnished by Definition 4.36 applied on ti. 
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Figure 12: A bunch of cascades 



Let us construct a term t" and a term tg such that fulfils the properties 
announced by the lemma. We define 

P := CL(7'os(<T') U yo ■ Pos(ro), Vos{h)) (61) 

We then define : P ^ U Q^^ by, for every u e P: 

if M e In{P), t'liu) := ti(u); if u £ C\/{P), t" (u) := pi{u); (62) 

where pi is the partial run of the automaton A associated with the computation 
ti — >^ Zj^. Let be the unmarked term obtained from t'( by applying the rule 
~* ''0 "backwards" i.e. there exists a substitution t such that: 

t'o = C[loT]y„, t'{ = C[roT]y^ 

and finally, let t'^ be the marked term obtained from the domain and labels of 
(on one hand) and the marks of to (on the other hand). One can check that: 

^0 -^*A % k°^n t'l -^A i'l- (63) 

We distinguish two cases, according to the relative position of the root yo of the 
given occurrence of ro and of the transversal U . 
Case 1: yo is above at least one Ui 

Le us suppose that yo "i, yo ^ • ■ • J/o ^ Ui^p and Vj £ [0, i — 1] U [i + 

p+ l,m],?/o ;^ Uj (sec Figure 13). Let 

Uo := (tto, . . . ,Ui-i,yo,Ui+p+i, . . . ,Um). 
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*o t'{ A 

Figure 13: Lemma 4.37, Case 1 

Uq is a transversal of t'^. Condition (BCl) is clearly fulfilled by the Uj, for 
j G [0, i — 1] U [i + p + 1, m]. Since the transversal U of term t'l was fulfilling 
condition (BC2) and i'^ is m-increasing, no leaf of the occurrence of tq can be 
strictly above U . Hence t'l/yo consists of an occurrence of followed by some 
subcascades of the cascades ti/uj (for j G + p]) and, possibly, some new 
cascades of level 0. Thus, the subterm of at position consists of an occur- 
rence of a marked version of Iq followed by some subcascades of the cascades 
ti/uj (for J G + p]) and, possibly, some new cascades of level 0: this is a 
cascade for Dq. We have thus checked condition (BCl). 

Let V £ 'Pos(Zq). If w -< Uj , for some j e [0, z — 1] U + p + 1, m], since U was 
quasi-null, 3j' e [0,to], v -< Uj> & m(ti/uj') = 0. 

- If j' S [0, t — 1] U -l-p -I- 1, m] the mark of Uj> in tg is the same as in t^, hence 
Mt'o/uj') = 

- If j' G + p], ya,v must be comparable, but yo ^ v would imply i/q ^ Vj, 
which is false; hence v -< yo & m(?Q/j/o) = 0. 

Ifv ~< yo, then v ^ yo &^ m(to/yo) 0. 

We have checked that Uo is quasi-null (i.e. (BC2)). 

The occurrence of lo which is used in the first step of D is the source of the 
cascade at position yo, hence (BC3) holds. 
Case 2: yo is strictly below one Ui 

Let (zo, . . . , Zp, {ui)~'^yo, 2p+i, ■ • ■ , -Zp+p') be the smallest transversal of Zp/ui ex- 
tending the antichain (j/o) (see Lemma 2.2). Let 

Uo '■— {uo, • . • , UiZi, . . . , M.^Zp, J/o, UiZp^i, . . . , UiZp_|_p/, Mi+i, . . . , Urn)- 

Uo is a transversal of tp. Every io/wj for j G [0, i — 1] U [i -I- 1, m], is a cascade 
(because it was a cascade of t'j^ w.r.t. Di). The subterm to/yo is a new cascade, 
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consisting of a marked version of Iq followed by some subcascades oit'i/ui and, 
possibly, some new cascades of level 0. Every subterm t'^/uiZx , for A G [0,p+p'] 
has, as residue in ti, the subterm ti/uiZ\, which is subcascade of the cascade 
ti/ui w.r.t. Di. Wc have thus checked condition (BCl). 
Let V e Vosit'g). 

If u Uj , for some j G [0, i — 1] U [i + 1, m], since U was quasi-null, 3j' G [0, m], 

V -< Uji & m(l'i/uji) ~ 0. 

- If j' ^ i the mark of Uji in is the same as in ti, hence m(iQ/Mj') = 

- If j' = I, then V <u.i<yok m(Zo/?/o) = 0. 

li V ^ UiZ\, for some A G [0,p + p'], either v ^ Ui and we can conclude as 
above, or v < uj for some j G [0, i — 1] U [i + 1, m] and we have already con- 
cluded, or V = Uiv' k, v' ^ z\. By Lemma 2.2, v' -< {ui)^^yo and we get that 

V ~<yo t m(?o/yo) = 0. 

We have checked that Uq is quasi-null (i.e. (BC2)). 

The occurrence of which is used in the first step of D is the source of the 

cascade at position yo, hence (BC3) holds. 

□ 

Definition 4.38. Let D be a derivation of the form (59). A term 

S G T((J^UQ)^) is called a cascade o/ exact level h w.r.t. D iff S is a cascade 

of level h w.r.t. D and S is not a cascade of level ft, — 1 w.r.t. D. 

Definition 4.39. Let D be a derivation of the form (59) where the system TZ 
has no empty lefthand-side. The marked derivation D is bu^(A:) iff, Vi G [0,£[, 
mmin(^i) = and mmax(Zi) < k. 

In words: D is bu^(fc) iff the steps of rewriting (modulo TZ) are meeting the 
conditions of the usual bu~(fc) condition (note that no condition is required on 
the steps of rewriting (modulo A)). 

Lemma 4.40. Let D be a derivation of the form (59). If S is a cascade of 
exact level h w.r.t. D and D is bu^(A:), then h < k. 

Sketch of proof. Let D be a derivation of the form (59). One can prove by 
induction over h the more general statement: 

if S = s-a (where s ^ Var) is a cascade of exact level h (with this decomposition) 
w.r.t. some derivation Di, and every internal node of s has a mark > r, then 
there exists A > 0, such that a mark > r + h — I occurs in the occurrence h+x 
of lefthand-side of rule used in the i + A-th step of D (60). 
Since D is bu~(fc), a mark > k cannot occur in h+x, hence no cascade 5* w.r.t. 
D can have an exact level > fc + 1 . □ 

Proof of lemma 4.32: 

Let s,t G T{T-^),q G Q-^,n G N, such that s fco— >k t is bu^(fc),s is m- 
increasing and t q. 

Using inductively Lemma 4.37, we obtain a term s' and derivations 
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such that every t- (for < i < £ - 1) is a bunch of cascades for Di. By Lemma 
4.40 every cascade w.r.t. Di has level < k. Hence the i-th step of k^^n in 
derivation _D is a step for the relation ^Si- It follows that I? is a derivation 
modulo (iSiU -^a)-^ 

Theorem 4.41. Let k >2, let {TZ,T) he a finite rewriting system in BU~(fc), 
with A{TV) > 1 and A be some f.t.a over T recognizing a language T . One can 
compute a f.t.a B recognizing {^^)[T] in time polynomial w.r.t. 

lj7e||fc-(A(7J))'=-i . 

Note that: 

- for systems with fc < 1, the complexity is analyzed in Theorem 4.41 

- for systems with k > 2 and A{TZ) — 0, i.e. ground systems, the complexity is 
covered by Theorem 4.27. 

Sketch of proof.; 

Step 1: Let us assume TZ has no empty lefthand-side (18). 
By Lemma 4.31 and Lemma 4.32 

i k~-n)m = {^s«uA)[Qf]^n^) 

The construction of B consists in computing the ground system (obtained 
by erasing the marks in the system Si introduced by Definition 4.29), to apply 
the construction of Theorem 4.27 and, finally, to perform a direct product with 
a f.t.a recognizing T{J-). 

The set of possible substitutions ri, . . . , r^, Tk-i in Definition 4.29 has cardinality 
less or equal to 

j|.^j|A(7Z)+A^(K) + ...+A'=-i(TC) 

The set of possible final substitution Tk has cardinality less or equal to 

Since the number of rules of TZ is less or equal than \\TZ\\ we get an upper-bound 
for the number of rules of Si : 

Card(5°) < ||7^||l+AW+A^(7^)+...+A''-l(7^) . (64) 

For every rule £1 ri G iS?, since ii = £ ■ ti- ■ - Ti - ■ -Tk-i • Tk and ri = 
r ■ Ti - ■ ■ Ti - ■ ■ Tk-i ■ Tk for some t ^ r E TZ, and for every variable v G V, 
\Ti{v)\ < \\Tl\\, we have 

l^i| + |ri| < \\n\\ ■ (1 + A{n) + A\n) + ■ ■ ■ + A''-\n)) 

< \\n\\ ■ k ■ A''-\TZ) (65) 



42 



Multiplying the upper-bound for the number of rules by the upper-bound for 
the size of each rule, we obtain 

< ||7^||4•'^■^'"('^) • Mr'(^). (66) 

(we assume that TZ ^ % hence that 2 < j|7?.j| in the above majorization: for 
72. = 0, anyway, the computation of B consists of taking B := A, which takes 
no time ). The construction of the system 5° is straightforward, thus takes 
a time polynomial in H^^H. The computation, from A and S^, of a f.t.a A' 
recognizing {-^*soyjj^[Q f] takes a time polynomial in ||^|| -I- \\Si\\ by Theorem 

4.27. By inequality (66), \\A\\ + l|>Si°|| < \\nf-^-^'~'^^^ ■ which is a 

polynomial in \\R\\^-'^'''^^'^'^ ■\\A\\^''^'^\ het P C be the subset of symbols that 
have at least one occurrence either in the transitions of A or in the rules of TZ. 
Finally, B is obtained from A' by performing the direct-product of A' with a f.t.a 
recognizing T{J-'). The overall computation of B thus takes a time polynomial in 
|J^|'.||7^f-A'-'(K).p||A''(TC)_ Since we assumed that fc> 2, A(7e) > 1, ||7e|| > 2, 
it is also a polynomial in 

,|^||fc.A'=-(K) . \\j^\fi.n)^ 

Step 2; Let 72 be a general TRS (which, possibly, possesses some empty Ihs). 
The transformation TZ i— > TZi defined in §4.2.1, is a polynomial reduction of the 
general case to the subcase treated in step 1. □ 

Note that, for every fixed parameters k >1 and A{TZ), the construction of the 
set of ancestors of a rational set for some TRS TZ in BU~(fc) can be achieved in 
polynomial time. In general, for a fixed A(7Z) and variable k, the dependency in 
k is double exponential. In the case of unary terms wc get only an exponential 
complexity. 

Corollary 4.42. Let k > 2, let J- be a signature with .symbols of arity < 1, let A 
be some f.t.a recognizing a language T C T{J-) and let TZ be a finite rewriting 
system in BU~(fc). One can compute a f.t.a B recognizing (— in time 
polynomial w.r.t. 

\\nt-\\A\\. 

Proof. If A(72.) = 1; just replace A(72) by the integer 1 in the conclusion of 
Theorem 4.41. If k{TZ) = 0, the rcsuh follows from Theorem 4.27. □ 

4.3.3 Lower-bound 

We show here that, there exists a fixed signature J- and two fixed recognizable 
sets Li,L2 over J- such that, the accessibility from Li to L2 for a rewriting 
system in BU~(1) is NP-hard. This shows that the upper-bound given by The- 
orem 4.41 in the case of a fixed parameter k, which is exponential w.r.t. ||72.||, 
cannot presumably be significantly improved. 
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Let us fix the signature T {/, 5, A, V, 0, 1} where the arities are 2, 2, 2, 2, 1, 0, 
(for the symbols in the given ordering). We shall also use the subsignature 
T' \^ T\ {/, g}. Let us consider the regular term-grammar over the signa- 
ture J-", with non-terminals Tq, Ti, T2, Go, Gi, G and with set of rules: 

Ti ^ 1 

^2 ^ /(T2,G) + 2 
Go ^ 5(Go,0) + 
Gi ^ .g(Gi,l) + l 
G — Go + Gi 

For sets of terms ii, L2 we abbreviate by L\ L2 the sentence 3ti G ii, 3f;2 G 

L2,ti — >^ ^2- 

Theorem 4.43. T/ie problem to decide, for a given linear term rewriting system 
{n,T) in BU^(l), whether L(e, Ti) L(^, Tz), is NP-hard. 

We reduce, in P-time, the problem 3-SAT to the above problem. Let (p be 
some propositional formula in 3-Conjunctive Normal Form: ip is a formula with 
n variables xi,X2, ■ . ■ ,Xn of the form 

n 3 

^ = A V <i (67) 

k=l l=\ 

where v-k,(. G {xi, 2:2, . . . , £k,t S {—1,-1-1} with the convention that v'^^ 
(resp. denotes v (resp. -^v). Let us define a kind of linearization of ip over 
a set of 3?!^ new variables Xij, for l<i<n, 1<J< 3n. 

n 3 

^■■=K\l^ti (68) 

fe=l £=1 

where V}^^i := x^j iff (w^,^ = xi and Card{(fc',f) G | (fc',^') </ex 

(fc,£) & Wfe'^f' = Wfc,f} = j). In words: -0^/ := Xi,j when the meta-variable 
i;fc,f denotes the variable Xi and it is exactly the j-tli occurrence (from left to 
right) of Xi in formula (67). Note that (p is linear and 

ip ~ p><j (69) 

for the substitution 

(T : Xi^j ^ Xi. (70) 

Let us denote by cc^^, the sequence of 3n variables (xi^i, . . . ,Xi,3„) and by a;*,* 
the sequence of 3ri^ variables (a;i,i, . . . , x^j, x^j+i, . . . , Xn,3n)- We define three 
sequences of terms (/„)„>i, (5n)n>i, (hn)n>i by the following recurrence rela- 
tions: 

fi{xi) := /„+i(xi,a;2, • ■ • ,a;„+i) := f{fn{xi,X2, ■ ■ . , x„), a:„+i), 
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gi{xi) ■■= Xi, 5„+i(xi,X2, . . . ,Xn+l) ■= g{gn{xi,X2, ■ ■ ■ ,Xn),Xn+l), 

/in (a;*,*) /ri+i(2, g3n(a;i,*), . . . ,g3nixi,*), ■ ■ ■ , 53n(a;„,»)). 

We define a fixed ground rewriting system VC consisting of the rules allowing 
to evaluate a Boolean formula, taken in reverse order: 

O^OAO, O^OAl, O^IAO, l^lAl, 
O^OVO, l^OVl, I^IVO, l^lVl, 
1^-0, 0-^-1. 

(The initials VC intend to make the reader think of "Propositional Logic" ) . We 
define the special rule associated with ip by: 

(^(a;*,*) -> /i„(a;*,*) (71) 

where ip is some term over J- expressing the Boolean formula ip (the n-ary meta- 
symbol /\ can be translated as a left-comb with internal nodes labelled by the 
binary symbol A and similarly for the ternary symbol Y). We finally define the 
system {T,TZ^p), associated with Lp, by: 

TZ^ := VC U {^{x^,^) /i„(a;*,*)}. 

We cut into several lemmas the proof that (p ^ TZ^p is a valid reduction. 

Lemma 4.44. For every 6i, 62, . . . b £ {0, 1}, b p{bi,b2, . . . , b„) iff 

b -^TZy, fn+l{'2,g3n{bl,bi, . . . , 61 ) , 53,1 (62 , ^2 , ■ • ■ ,^2), • ■ • , ff3ri (^n , &ri , • ■ • 

Proof. Using the special rule (71) we get: 

0''' ^TZ^ fn+1 (2,53„(6i,6i, . . ■ ,bi),g3n{b2,b2, ■ ■ . ,62), • ■ • ,53n(&«,^«, ■ • ■,bn)) 

(72) 

where the substitution r is defined by T{xi_j) := bi. We can factorizc r as 
T := a o 9 where a was defined in (70) and 9{xi) := bi. The one-step rewriting 
(72) thus can be seen as 

{(P<t)9 ->tc^ /„+i(2,53„(fei,5i, . . ■ ,bi),g3n{b2,b2, ■ ■ . ,62), . . ■,g3n{bn,bn, ■ ■ ■,bn)) 
which, by the identity (70) shows that 

(p{bi,b2, ■ ■ ■,b„) = ipd ->-R^ /„+i(2,.g3„(6i,5i, . . . , 61), g3„(&2, 62, • ■ • ,^2), • ■ ■,g3n{bn,bn, ■ ■ ■,bn)) 
The lemma follows easily from this last relation. □ 
Lemma 4.45. The following two conditions are equivalent: 

1- {i}^*^^L{g,T2) 

2- There exist 61, 62, ... 6„ G {0, 1}, such that: 

1 -^n^ /n+l(2,ff3«(6l,fol, . • . ,bl),g3n{b2,b2, ■ ■ . ,62), • . ■,g3n{bn,bn, ■ ■ .,&«))■ 
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Proof. 1-Suppose condition 1 holds: there exists t2 G L{Q,T2) such that 1 ^tj^ 
t2- Since no rule of VC~^ can be applied on t2, the derivation must decompose 
as 

where the last step uses the special rule (71). Note that t2 has exactly one 
occurrence of the constant 2 while t'2 has no occurrence of this symbol. Since 
1 — >^ t'2, the term must belong to T{J-'). This implies that the contractum 
in <2 was t2 itself: 

t2 ■— h„{x^,^)T 

for some some substitution r : {xij |l<i<n, l<j< 3n} T{!F'). From 
the fact that t2 G L(0,T2) we deduce that, V« G [l,n],3&i G {0, 1}, 

T(a;i,i) = T{xi^2) = . . . = T(2:i^3„) = hi. 

Hence ^2 = /n+l(2, g3n(&l, • ■ • , ^1), 53n(&2, ^2, ■ • ■ ,^2), ■ • ■,g3n{bn,bn, ■ ■ 

and point 2 holds true. 

2- Suppose condition 2 holds. Since the last term of this derivation belongs to 
L{g,T2), point 1 holds true. □ 

For every marked term t, we call a path P' C t a .F'-path iff all the labels 
of P' belong to T'. Let us consider the following property P{s) of a term 
sGT(^^): 

VP' C Pos(s), if (P' is a J^' - path and Min{m(s/u) | m G P'} = 0) 

then Max{m(s/M) | u G P'} = 0. (73) 

Every wbu rcwriting-step of o^k^ preserves P in the following sense 

Lemma 4.46. For every 's,t G T{J-'^) if (P(s) and's o^-ji^ t is a \Nhu- rewriting 
step), then P(t). 

Proof. Let us consider a wbu rewriting step 

s = C[Ja] o->7j, C[rW] = t. 

- If the rule used belongs to VC, 7 = & for some 6 G {0, 1} and r is a term over 
T' with only null marks; thus every .F'-path Q of t must either be included 
in C (case 1) or is obtained from a path P' of s by replacing its maximal 
element (labelled by b) by two elements (labelled by symbols of JF')(case 2). 
In case 1 Max{m(s/u) | u G Q} = because this was true in s. In case 2 
Max{m(s/u) | u G Q} ~ because the labels of Q' were null and the labels of 
the two new nodes are also null. Hence P is preserved; 

- If the special rule is used: since r has no label in T', every JF'-path Q oit 
must be either included in C (case 3) or included in x'a for some variable v of 
r. In case 3 Max{m(s/u) | u G Q} = because this was true in s and in case 4, 
Min{m(s/u) | u G P'} > 1. Hence P is preserved. □ 
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Lemma 4.47. For every Boolean formula tp, the system TZ^ is linear and 
BU-(l). 

Proof. It is clear that TZ^p is linear. Let us consider a wbu marked derivation 
(modulo n^): s o->" t o-> F with s € T{T). 

- li I ^ r ^ PC, since the step is wbu, the root of I has a null mark and no 
other mark appears in I since it has depth 0; 

- If / — > r is the special rule, since every branch of / is labelled by a word in 
J^'*V and t fulfills P, and I has a root marked 0, all internal nodes of I have the 
mark 0; hence mmax(7) = 0. 

By induction over the integer n we can thus prove that, for every s £ T{J-), 
every wbu marked derivation s o^" t is bu^(l). □ 

Let us prove Theorem 4.43. By Lemmas (4.44-4.45) Lp is satisfiable iff L{Q, Ti) 
L(^, T2). Moreover ^Z^p is computable in P-time from tp and, by Lemma 4.47, it 
belongs to BU~(1). Hence ^ TZ^ is a P-reduction of the satisfiability problem 
for Boolean formulas in 3-CNF to the problem under consideration. 

5 Testing the Bottom-up property 

We investigate here the question whether the properties BU(fc) (resp. BU~(fc)) 
are decidablc, or not. We concentrate first on the case of semi-Thue systems: 

• we establish a criterium (i.e. a Necessary and Sufficient Condition) for 
the property BU(fc), in the case of length-increasing semi-Thue systems 
(Proposition 5.6) 

• we show that the property BU(fc) is decidable in some non-trivial sub- 
class of length- increasing semi-Thue systems (Proposition 5.7) and that 
it is undecidable for general length-increasing semi-Thue systems (Theo- 
rem 5.11). 

• we deduce the undecidability of the property BU(A;) for term rewriting 
systems (Theorem 5.12). 

5.1 A criterium for semi-Thue systems 

More notation for derivations Let us borrow to [7, 20] some useful notation 
for dealing with derivations and, especially, equivalences over derivations. We 
assume some semi-Thue system TZ over an alphabet Y is given. For every rule 
R = I ^ r and words v,w € Y* , we note d^{{v, R, w)) := vrw, d~{(v, R, w)) := 
vlw. We call derivation any non-empty sequence of triples the form 

D = ((wi,i?i,Wi) . . . , {vi,Ri,Wi), . . . , {Vn,Rn,Wn)) (74) 

such that, for every 1 < i < n — l,d^{viRiWi) = (9^(u,;+ii?i+iU'i+i) and also 
the triples 

A, :=(w,ID,£) (75) 
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where ID is a special symbol that we view as the Identity rule. We extend the 
notations 9" by defining for D given in (74): 

d+iD) := Vnd+iRn)wn, d~ (D) vid-iRi)wi 

and for Z?„ given in (75): 

d+{D,) ■.= v, d~{D,) -V. 

We define an equivalence « on derivations by 

D^D' ^ (d+iD) = d+iD') & d-{D) = d-{D')) 

i.e. D, D' are equivalent when they have same starting word and same ending 
word. The length £{D) is defined as n for the derivation (74) and for the 
derivation (75). Given derivations D,D' such that d+{D) = d~{D'), their 
composition D ® D' is just their concatenation (when they both have non-null 
length), D when £{D') = 0, and D' when 1{D) = 0. 

The words oiY* act on the right and on the left over derivations: for D defined 
by (74) we set 

D - v := {{vi,Ri,wiv) . . . , (vi, Ri, Wiv), . . . , (u„, i?„, w„w)) 

and V ■ D is defined similarly; Du ■ v D^v and v ■ Z?„ :~ Dyu- 

One can easily check that, for every derivations D, D' , F, F' , words u,v and 

signs a 6 {+1, -1}: 

u{D (g) D')v = uDv uD'v, 
d°'{uDv) = ud°'{D)v, 
d+{D®D')^d+{D'), d-{D®D')=d-{D). 
From these formulas it follows easily that, 

{D K D' k F K F') ^ D ® F K D' ® F' 

D' ^ uDv w uD'v. 

These two last compatibility properties will be widely (though implicitly) used 
in our proofs. We call a derivation D right- minimal iff its only decomposition 
as D ^ D' ■ v' is the trivial one: D' = D, v' = e. 

Some basic properties Let TZ be any semi-Thue system over some alphabet 
Y. 

Lemma 5.1. For every derivation D and word u G Y* , D is bu(fc) iff D ■ u is 
hu{k). 

Sketch of proof. <J= is clear. 

=^: In the marked derivation, w.r.t TZ (see §2.5), associated with D, the letter 
# has marks in [0, k]. In the marked derivation (w.r.t TV) associated with D ■ u, 
all the positions of the suffix u# will have the same mark which is the same 
integer as before, hence belongs to [0, k]. □ 
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Lemma 5.2. For every derivations Di,D2, if Di D2 is bu(A:), then Di and 
D2 are bu(fc). 

Sketch of proof. Suppose that Di (E> D2 is bu(fc). Let us set i{Di) — £1, £{02) = 
The fact that Di is bu(A:) too is straightforward. 

For every i g [0, £2] , the mark of the jth position of the ith word of D2 is smaller 
than the mark of the jth position of the {£1 + i)th word of Di® D2. Thus, the 
hypothesis ensures that all the marks of the marked derivation associated with 
D2 are in [0, k]. □ 

Lemma 5.3. For every derivations Di,D2, word ui G Y* and rule Ri G TZ, if 
Di ®ui ■ Ri is bu(fc) and D2 is bu(fc), then Di (g) ui ■ Ri ® D2 is bu(fc). 

Sketch of proof. The derivation Di ® ui • Ri is bu(fc), hence weakly bottom- 
up. Hence all the positions of ui in the Ihs and rhs of the last step of this 
derivation have a null mark. All the positions of d^{Ri) in the rhs of the last 
step also have a null mark, by definition of the marking process. Finally, the 
word d^{Di ui ■ Ri) has only null marks in the marked derivation (w.r.t. TZ) 
associated with Di (g ui ■ Ri. Let us consider the unique marked derivation D 
(w.r.t. TV) associated with the derivation Di ® ui ■ Ri ® D2: 

i3:wo#,u;i#"\...,iD„#™" 

- its part labelled over Y , {wq, iDi, . . . , Wn) is obtained just by concatenating the 
corresponding marked derivation associated with Di ui • Ri and the marked 
derivation associated with D2 (where the final letter # has been erased); in 
particular this proves that every step fulfils definition 3.7, hence that D is wbu; 
it proves also that all the marks of the words iuo,ij}i, . . . ,Wn belong to [0, k]; 

- every mark nip, for < p < ^1, where £1 := £{Di (g) ui • i?i), belongs to [0, k], 
because Di (gui ■ i?i is bu(fc); 

- one can prove by induction over p, for £1 < p < n, that every mark rup belongs 
to [0, A;]: m^^ G [0, fc] and, if p > £1, rup is the maximum of TOp_i and of the 
mark on the # in the corresponding word of the marked derivation (w.r.t. TZ) 
associated with D2. 

Hence D is bu(fc), which entails that Di (g) ui ■ Ri (g) D2 is bu(fc). □ 
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Definition 5.4 (Minimal right-overlap). Let k > 1 and let TZ be a semi-Thue 
system over some alphabet Y. Let us call minimal fc-right-overlap a 7-tuple 
{D, R, R' ,u,v,u' ,w) such that D is a r-minimal derivation, R,R' are rules of 
TZ and u, u, u', w are words in Y* , fulfilling: 

1- d+{D) =d-{uRv), 

2- d+{uRvw) = d~{u'R'), 

3- uRv is BU{k) 

^- < \w\ < \vw\ < \d-{R')\. 

The minimal k -right- overlap is said resolved iff there exists a bu(fe) derivation 
D' « Dw (g) uRvw (g) u'R'. 

(See Figure 14). 

Lemma 5.5. Let k > 1 and let TZ be a length-increasing semi-Thue system over 
some alphabet Y. Let us suppose that all the minimal k-right-overlaps ofTZ are 
resolved. 

Then, for every bu(fc) derivation D, rule R' £ TZ and words u',v' G Y* , if 



then 



d+{D) =d-{u'R'v' 



there exists a bu(fc) — derivation D' D ® u'R'v' . 



(76) 
(77) 
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Figure 15: Adding one rule 
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Figure 16: Adding one rule 



Proof. We prove that every {D, R' ,u' ,v') fulfills the implication ((76) ^ (77)), 
by Noetherian induction over the pair (|w'|,£(D)), using the lexicographic or- 
dering on N X N. 
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Let us consider some {D, R', u' , v') such that D is bu(fc) and the hypothesis 
(76) holds. One of cases — 3 below must occur (see Figures 15-16). 
case 0: 1{D) = 

In this case we can choose D' = D ® u'R'v' = u'R'v' which is bu(fc). 

case 1: £{D) > 1, D = DiVi where Di is a right-minimal bu(fc)-dcrivation and 

|5+(i^i)| < \d~{u'R')\ 

Since Di has non-null length, is right-minimal and bu(fc). it decomposes as 

Di= D'^®uiRi®D'^ (78) 

and the assumed inequality on the boundaries implies that v' = v"vi for some 
word v". The tuple (D'^, R' ,u' ,v") fulfills hypothesis (76) and {\u'\,£{D'^)) = 
{\u'\,£{D) — 1) < {\u'\,i{D)). By induction hypothesis, there exists some bu(fc)- 
derivation D3 « D2 (g) u'R'v" . Let us choose 

D' := {D'2(^uiRi®D3)vi. 

Lemma 5.2 applied on decomposition (78) shows that D'2 (E) uiRi is bu(fc). 
By Lemma 5.3 D'2 ® uiRi ® D^, is bu(fc) and by Lemma 5.1 wc get that 
{D'2 (E> uiRi ig) D3)vi is bu(fc). Hence D' is a bu(fc) derivation such that D' « D. 
case 2. D = (Di uRv)wv' where Di is a right-minimal bu(fc)-derivation and 
< |w| < \vw\ < \d-{R')\ . 

The 7-tuple (Z?i, i?, i?', u, w, w', w) is a minimal /c-right-overlap, hence it is re- 
solved: there exists a bu(fc) derivation D2 ~ Diw uRvw (8) u'R'. Choosing 
D' D2V' we obtain the conclusion (77). 

case 3: D = {Di uRv)w where Di is a right-minimal bu(fc)-derivation and 
\ud-iR)\ < \u'\ . 

The inequality on the boundaries implies that "i? and R' can be exchanged" 
i.e. that there exists words a,(3 G Y* such that uRvw u'R'v' ?» aR'v' uRp. 
Thus 

{Di (g, uRv)w ® u'R'v' K. Diw ® aR'v' ® uRl3. (79) 

Note that, as every rule of TZ is length-increasing, \a\ < \u'\. The 4-tuple 
{Diw,R',a,v') fulfills hypothesis (76), and {\a\,e{Diw)) < {\u'\,£{D)). By 
induction hypothesis there exists a bu(fc)-derivation 

D2~ Diwf^aR'v' . (80) 

The 4-tuple {D2,R,u,(3) fulfills hypothesis (76), and {\u\,t{D2)) < {\u'\,£{D)) 
(because |u| < |w'|). By induction hypothesis there exists a bu(fc)-derivation 

D' KD2® uR(3. (81) 

Combining the hypothesis of case 3 with equivalences (79) (80) (81) we obtain 
that D' Ki D ® u'R'v' , as required. In all cases wc have proved the announced 
implication: 

((76) ^ (77)) 

□ 
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Proposition 5.6. Let fc > 1 and let TZ be a length-increasing semi-Thue system 
over some alphabet Y . The system TZ is BU(fc) iff all its minimal k-right-overlaps 
are resolved. 

Proof. (^) 

Suppose that 7?. is a semi-Thue system over some alphabet Y and that it is 
BU(fc). Since every derivation must be equivalent to some bu(fc) derivation, it 
is clear that every minimal fc-right-overlap is resolved. 
(^) 

Let TZ have all its minimal fc-right-ovcrlaps resolved. Let us prove, by induction 
over i{D), that, for every derivation D, there exists some bu(fc) derivation D' « 
D. 

Basis: £{D) = 0. 

In this case D is bu(fc), hence we can choose D' = D. 
Induction step: £{D) = n + 1 for some n > 0. 
D has a decomposition of the form: 

D = Di® u'R'v' 

for some derivation Di, with length i{Di) = n, some rule R' £ TZ and some 
words u' ^v' G Y* . By induction hypothesis, there exists some bu(fc)-dcrivation 
D'l such that 

By Lemma 5.5, there exists some bu(fc)-derivation D' such that 

D' « D[® u'R'v'. 

It follows that D Ki D', as required. □ 

5.2 Decidable/undecidable cases for semi-Thue systems 
A decidable case 

Proposition 5.7. Let k>l. The property BU(fc) (resp. BU^{k)) is decidable 
for length-increasing semi-Thue systems fulfilling the additional condition below: 
C: TZ has no right-linear recursion i.e. there is no finite sequence of rules 
[li — i- r,;)i<i<„ such that, for every 1 < i < n — 1, ri is a suffix of k+i, n > 2 
and li — In. 

Proof, (sketch) 

Let TZhe a, length-increasing semi-Thue system fulfilling condition C. By propo- 
sition 5.6 a necessary and sufficient condition for TZ to be BU(fc) is that all its 
minimal fc-right-overlaps are resolved. By condition C this set of minimal k- 
right-overlaps is finite and constructible. Hence the above necessary and suffi- 
cient condition is testable. □ 
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Undecidable cases We treat first the case of tlie property BU(1). 

Proposition 5.8. It is undecidable whether a finite length-increasing semi-Thue 
systemU is BU(1) (resp. BU"(1);. 

Our proof will use the following variant of the universality problem for 
context-free grammars: 

Input: A context-free grammar G = {Z,N,P) where Z = {zi,Z2} is the ter- 
minal alphabet, n is a strictly positive integer. N = {Si, . . . ,Sn} is the non- 
terminal alphabet and P C N x V{{Z U iV)+) is the finite set of rules. 
Question: Vs e Z*,Si -^*p sSil 

We call this problem the Modified Universality Problem (MUP in short). It 
follows easily from the undecidability of the classical universality problem for 
context-free grammars ([16, Theorem 8.11 p. 203]) that the above problem is 
undecidable. 

Let us consider an instance G of MUP. We introduce some fresh symbols 
So, ai, 6 not in Z U and define the alphabet Yi := Z U TV U {S'o, ai, 6}. Let 7^ 
be the semi-Thue system over Yi whose set of rules consists of the union of P 
with the three new rules: 

•S'o ziSq, Sq — > 22<S'oj Soai Siai. 

We call Rzi, Rz2, Roi (in the above enumeration order) these new rules. We 
decompose in two lemmas the proof that G 7^ is a reduction of M U P to the 
problem whether a semi-Thue system is BU(1). 

Lemma 5.9. IfTZ is BU(1), then, for every s G Z*,Si -^*p sSi. 

Proof. Suppose TZ is BU(1). Let s G Z*. Consider the following derivation: 

Soaib -^"^ sSoQib -asmg Rzi, Rz2 
— >K sSiQib using Rqi. 

The only possible bu(l) derivation with same boundary in the system TZ would 
be the composition of the first step 

Soaib ->-R. Siaib using i?oi 

with a derivation 

Si^psSi (82) 
in the right-context aib. The existence of derivation (82) is thus ensured. □ 

Lemma 5.10. If for every s E Z* , Si -^*p sSi, then TZ is BU(1). 
Proof. Let us suppose that 

ys e Z*,Si sSi. (83) 
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Let us consider some minimal 1-right-overlap {D, R, R' , u, v, u', w) of the system 
TZ. The only possible value for R' is i?oi while R might be either R^i or i?z2- 
It follows that w = ai and v = e. Since D (g) uRv is bu(l), it has the form 

D ® uRv : Sq -^*p sSq 

for some s G Z+. Hence 

Dw uRvw ® u'R' : 5*001 -^p sSqcii ^Roi sSiai. 

By hypothesis (83) there exists also a derivation Di of the form Di : 5i sSi. 
Let us choose 

D' := Roiai ® Di. 

Since D' : SqUi sSiai is bu(l), the only minimal right-overlap is resolved. 
By Proposition 5.6, it follows that TZ is BU(1). □ 

Let us prove now Proposition 5.8. 

Proof. By Lemma 5.9 and Lemma 5.10, G TZis a many-one reduction of MUP 
to the problem whether a finite length- increasing scmi-Thue system is BU(1) or 
not. This last problem is thus undccidablc. □ 

We treat now the case of BU(fc) for an arbitrary fc > 1. 

Theorem 5.11. For every k > \ , it is undecidahle whether a finite length- 
increasing semi-Thue systemTZ is BU(fc) (resp. BU~(fc)j or not. 

(Note that for semi-Thue systems, since every variable of a Ihs of rule must 
appear in the corresponding rhs, the properties BU(fc), BU~(fc) are equivalent). 

Proof, (sketch) Let fc > 1. Given an instance G of MUP, we construct an 
alphabet := 2'UA^U{S'o,i, . . . , S'o.fe, fli, 02, . . . , a^, 6}, and a semi-Thue system 
TZ over consisting of the union of P with the two new rules : 

^O,! ^I'S'o,!, 5*0,1 Z2So,l, 

and the fc additional rules: 

5o4ai — > 5o,2ai, 5'o.2aia2 ^ 5o.3aia2, ■ • ■ , 5'o,fcaia2 Siaia2 

One can check, by arguments similar to those used in the proof of Proposi- 
tion 5.8 that TZ is BU(fc) iff Ws e Z*, Si ~^*p sSi. Hence the property BU(fc) is 
undecidable. □ 



5.3 Undecidability for term rewriting systems 

Theorem 5.12. For every k > 1, the problem to determine whether a finite 
linear term rewriting system {TZ,J-) is BU(fc) (resp. BU~(fc)j or not, is unde- 
cidahle. 

Proof. For every finite semi-Thue system TZ the corresponding term rewriting 
system TZ (defined in §2.5) is finite and linear. Moreover, by point 2 of definition 
3.19, TZ is BU(fc) iff TZ is BU(fc). Hence this theorem is a straightforward corollary 
of Theorem 5.11. □ 
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6 Strongly Bottom-up systems 



Since the BU(fc) conditions arc, as such, undccidablc (Theorem 5.12), we are lead 
to define some stronger but decidable conditions. We study in §6.1 the strongly 
bottom-up ( SBU for short) restriction. We introduce in §6.1 a technical tool 
that will be used in §6.3 and §6.4 for giving a polynomially decidable condition 
implying condition SBU. 

6.1 Strongly bottom-up systems 

We abbreviate strongly bottom-up to sbu. 
Definition 6.1. A system {Tl,T) is said SBU(fc) iff 

for every derivation D : s — >^ t, from a term s G T{J-') to a term t G T(T), 

D is wbu <^ D is bu(fc). 

We denote by SBU(fc) the class o/SBU(fc) systems and by SBU = |Jj,gf^ SBU(fc) 
the class of strongly bottom-up systems. 

In other words: instead of requiring that the binary relations —>-^ and k^^i 
over 'T[J-) are equal, we require that all wbu marked derivations starting on an 
unmarked term use only marks smaller or equal to k. The following lemma is 
obvious. 

Lemma 6.2. Every SBU (A;) system is BU(fc). 

This stronger condition over term rewriting systems is interesting because 
of the following property. 

Proposition 6.3. For every k > 0, it is decidable whether a finite term rewrit- 
ing system {TZ,J-) is SBU(fc). 

Proof. Note that every marked derivation starting from some s G Ti^) and 
leading to some t g T(jr^) \ T{T-'') must decompose as 

s k+io—'Tz s' t, 

with s' G T(J^-'^+^) \ T{T-'^). A necessary and sufficient condition for TZ to be 
SBU(fc) is thus that: 

( k+io^*n)[r{T^''+') \ r(j-^^)]) n t{:f) = 0. (84) 

By Theorem 4.2 the left-handside of equality (84) is a recognizable set for which 
we can construct a f.t.a ; wc then just have to test whether this f.t.a recognizes 
the empty set or not. □ 

According to the results of [19] it seems likely that the property [3k > such 
that {TZ,J-) is SBU(A:) ] is undecidable for term rewriting systems. It is then 
interesting to look for a decidable sufficient condition. Our condition is based 
on a finite graph that we define in next subsection. 
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6.2 The sticking-out graph SG(7^) 

Let us associate with every Term Rewriting System a graph whose vertices are 
the rules of the system and whose arcs {R, R') express some kind of overlap 
between the right handside of R and the left handside of R' . Every arc has a 
label indicating the category of overlap that occurs and a weight which is an 
integer (0 or 1). The intuitive meaning of the weight is that any derivation step 
using the corresponding overlap would increase some mark by this weight. The 
precise graph is defined below and is directly inspired by the one of [30] , though 
slightly different. 

Definition 6.4. Let s e T{T,V), t e T{T,V) \ V and w e Posvit). We say 
that s sticks out of t at if 

1. Vt; G ■Pos(t) s.t. e ^ V -< w, V E 'Pos(s) and s{v) = t{v). 

2. w € Pos(s) and s/w <^T{T). 

If in addition s/w ^ V then s strictly sticks out oft at w. 

Definition 6.5. Let TZ = {h — > ri, . . . , — > r„} be a system. The sticking-out 

graph is the directed graph SG(7?.) — (V,E) where V = and E is 
defined as follows: 

a) if Ij strictly sticks out of a subterm of ri at w, i^-^ j € E; 

b) if a strict subterm of Ij strictly sticks out of ri at w, i^—ljE E; 

c) if a subterm of ri sticks out of Ij at w, i ^ j E E; 

d) if ri sticks out of a strict subterm of Ij at w, i^--> j E E. 

Figure 17 shows all the possibilities in the four categories (a), (b), (c), (d). 

Example 6.6. The graph of the system TZq = {f(f (x)) H^)} contains one 
vertex and two loops labeled (d) and (a). 

It can be shown with an ad hoc proof that TZq E BU (actually in BU (1)/ We 
have already seen in Example 3.18 that TZo ^ SBU. 

Example 6.7. The graph of system TZi = {g(f(g(a:))) ^{x)} contains one 
vertex and a simple loop labeled (b). TZ^ is not inverse recognizability preserving: 

HkJ[{/(«)}] = {5"(./(5"(«)))I«>0}, 
which is not recognizable. 

The weight of each arc of SG(7?.) is defined by: 

• arcs (a) or [b) have weight 1. 

• arcs (c) or (rf) have weight 0. 

The weight of a path in the graph is the sum of the weights of its arcs. The 
weight of a graph is the maximal weight of a path in the graph; it is infinite if 
the graph contains a cycle with an arc of weight 1. 

The sticking-out of TZi of Example 3.1 is given in Figure 18. 



56 




Figure 18: The sticking-out graph of TZi 



6.3 A sufficient condition for semi-Thue systems 

Let us fix a semi-Thue system TZ over an alphabet Y. The main result of this 
subsection is that, if every path of SG(7^) has a weight < k, then TZ has the 
property SBU(fc + 1). We prove some lemmas establishing some links between 
wbu derivations, on one hand, and paths of SG(7^), on the other hand. Again, 
we use the notation defined in §5.1 for manipulating derivations. 
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Figure 19: Downwards derivation 



Lemma 6.8. (Downwards derivations) 

Let R, R' G TZ, u, v, u' G Y* and D a derivation such that uRv ® D ® u' R' is a 
\Nhu- derivation. Then, there exists a path from R to R' in SG{TZ). 
(See figure 19). 
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Proof. Let us consider the following property P{n): 

for every wbu-derivation (uiRiVi)o<i<n+i for the system TZ, if |wn+i| = then 

there exists a path from Rq to Rn+i in SG(7?.). 

We show by induction over n that, for every n G N, P{n) holds. 

Basis: n — 0. 

We thus have |uo| + |9+(i?o)| + |Mo| = {Ri)\ + \ui\. Since this derivation is wbu 
we also have \vo\ < \d~{Ri)\ or |i;o| = \9^{Ri)\ = 0. From these inequalities it 
follows that (i?o, ^i) is an edge of SG{TZ). 
Induction step: n > 1 
We define 

i ■=iam{j e I \vj \ < \vo\ + \d+{Ro)\}. 

Since the given derivation is wbu, \vq\ < \vi\ + \d^{Ri)\ or (|uo| — and 
\d~{Ri)\ = 0). Hence {Ro,Ri) is an edge of SG(Jl) and {ujRjVj)i<j<n+i is 
a wbu-derivation fulfilling |wn+i| = 0. By induction hypothesis, there exists a 
path p from Ri to Rn+i in SG(7^). The edge (i?o, Ri) followed by the path p is 
a path from Rq to Rn+i in SG(7?,). 

Let R, R', u, V, u', D fulfill the hypothesis of the lemma. Let us note: 

■uo := It, i?o := R, Wo := D := (uji?iWi)i<,<„, := u' , Rn+i := R' , w„+i := e. 

Applying P{n) to the derivation (Mii?iWi)o<i<n+i; we obtain the conclusion of 
the lemma. □ 

Lemma 6.9. (Strict Downwards derivations) 

Let R, R' G TZ, u, v, u' G Y* and D a derivation such that uRv ® D ® u' R' is a 
\Nhu- derivation and \v\ > 1. Then, there exists a path with non-null weight from 
R to R' in SG{n). 

Proof. Let us consider the following property Q{n): 

for every wbu-derivation {uiRiVi)o<i<n+i for the system 7?., if |wo| > 1 and 
|f„+i| = 0, then there exists a path with non-null weight from _Ro to Rn+i in 
SG(7^). 

We show by induction over n that, for every n G N, Q{n) holds. 
Basis: n = 0. 

(5(0): we thus have |uo| + |9^(-Ro)| + |uo| = {Ri)\ + \n.i\. Since this derivation 
is wbu we also have \vq\ < From these inequalities it follows that 

(i?o, Ri) is an edge of type (a) or (b) of SG(7^). 
Induction step: n > 1. 
We define 

i := mm{j G [l,n+ 1] | \vj \ < \vo\ + |a+(i?o)|}- 
case 1: \vi\ > \vo\. 

In this case {Rq, Ri) is an edge of SG(7^) and {ujRjVj)i<j<„+i is a wbu-derivation 
fulfilling > 1 and |wri-Hi| = 0. Hence, by induction hypothesis, there exists 
a path p from Ri to i?„+i, with non-null weight, in SG(7^). The edge {Ro,Ri) 
followed by the path p is a path with non-null weight from Rq to Rn+i. 
case 2: \vi\ < \vo\. 
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In this case, since the given derivation is wbu, \vq\ < \vi\ + \d~{Ri)\. Hence 
{Ro,Ri) is an edge of weight 1 of SG(7^). By lemma 6.8 there exists a path p 
from Ri to i?„+i in SG{TZ). We can conclude as in case 1. 

From Q{n) wc can deduce the lemma. □ 
Lemma 6.10. (History of a mark) 

Let D be some marked \Nb\i- derivation and let y G Y^wi,W2 G Y* such that 
d~{D) is unmarked, d^{D) = Wiyw2 and the mark of y in the corresponding 
marked word is k > 0. Then, there exist u, w G Y* , R £ TZ and some derivations 
D', D" such that 

1- D ^ D' ® uRvyw2 ® D"yw2 

2- the mark of y in every step of D"yw2 is k 

3- the mark of y in d^{D') is < k. (See figure 20) 
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Figure 20: History of a mark 

Proof. Let us remark that every derivation D fulfilling the hypothesis of the 
lemma must have a length £{D) = n + 1 for some integer n > (since its result 
d^{D) has some non-null mark). We prove the lemma by induction on this 
integer n. 
Basis: n = 0. 

Thus D = viRv2 for some ui, W2 G Y* , R £ TZ. Since the given occurrence of y 
has a non-null mark, it must be a position of V2. It follows that D = uRvyw2 
for some words u,v E Y* . Let us define: 

D' ■— Dud- (H)vyW2J D" :— Dyj^. 

These derivations fulfill conclusions (1-3) of the lemma. 
Induction step: n > 1. 

By the same arguments, D — E ® uRvyw2 for some u,v £ Y* , R £ TZ and some 
derivation E of length n. 

Case 1: The mark of y in d^{E) = ud~{R)vyw2 is k. 

By induction hypothesis E has some decomposition as E ~ E' ® u'R'v'yw2 <8) 



59 



E"yw2 such that the mark of y in every step of E"yw2 is k 

and the mark of y in d+[E') is < k. Taking D' := E' and D" := E" ® uRv, the 

conclusion of the lemma is fulfilled. 

Case 2: The mark of y in d^{E) = ud~ (R)vyw2 is < k. 

Taking D' := E and D" := -D„9+(_r)i;7 the conclusion of the lemma is fulfilled. 

□ 

Let 7?. be a semi-Thue system and /c G N. We consider the following property 
PATH(fc): for every vi,'W2 ^ Y* , R' E TZ and wbu-derivations D, E such that 

E = D®viR'w2 & m{\zst{vid- {R'))) = k (85) 

there exists a path in SG(7^) with weight > k and with extremity R' . 

Lemma 6.11. Let TZ be a semi-Thue system. For every k G N, the property 
PATH(fc) holds. 
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= k 


= k 



Figure 21: From marks (in derivations) to weights (in paths) 

Proof. We prove by induction over G N the statement 

Vfc G N, PATH(fc). 

Basis: fc = 

There exists a path of length 0, thus of weight > 0, in SG(7?.), with extremity 
R'. 

Induction step: fc > 1 

Let us assume (85). Applying lemma 6.10 to the derivation D, to the let- 
ter y = Iast(z;i9~(i?')) and to the words :— vid^ {R')y~^ , W2, wc obtain 
u,v,R,D\D" such that: 

E = D' ® uRvyw2 ® D"yw2 viR'w2, 
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the mark of y in every step of D"yw2 is k and the mark of y in d^{D') is < k 
(see figure 21). By the definition of a marked rewriting-step, we must have: 

k = M{ul,x) 

where R ~ I ^ r, x is the variable of I and u is the marked word corresponding 
to the context where R is apphcd. Let us consider E' := D' ^uRvyw2- It fulfiUs 

E' = D'®uRvyw2 & m(last(wa"(i?))) = - 1 

By induction hypothesis, there exists a path p in SQ{TZ) with weight > — 1 
and with extremity i?; by lemma 6.9, there exists a path q with non-null weight 
from R to R' in SQ{TZ). The concatenation p ■ qis & path with weight > fc in 
SG(7^). □ 

Proposition 6.12. Let TZ be a semi-Thue system and k > 1. IfW{SG{Tl)) = 
k - 1 then TZ e SBU(/j). 

Proof. Suppose that TZ ^ SBU(fc). This means that some wbu-derivation (w.r.t. 
TZ) starting from a non-marked (unary) term over Y U {#} reaches a marked 
term with the mark k + 1. Let E be a wbu derivation (w.r.t. TZ) with minimal 
length reaching the mark k + l.Let us consider the derivation E (w.r.t. TZ) 
corresponding to E (it is obtained from E just by erasing all occurrences of the 
nuUary symbol #). This derivation E must have a decomposition of the form 
(85). By lemma 6.11 PATH(fc) holds, hence there exists a path in SG(7^) with 
weight > k. By contraposition, if VF(SG(7^)) < - 1 then TZ E SBU(fc), which 
proves the proposition. □ 



6.4 A sufficient condition for term rewriting systems 

Proposition 6.13. Let TZ be a linear system and k > 1. If W{SG{TZ)) = fc — 1 
then TZ G SBU(fc). 

Proof. (Sketch) Let us associate to TZ the scmi-Thuc system T corresponding 
to the "branch-rewriting" induced by TZ: it consists of all the rules 

such that there exists a rule I r E TZ, and a variable x E V , such that ux 
labels a branch of I and vx labels a branch of r. Suppose that the mark fc + 1 
appears in a 7?.-dcrivation. Since the marking-mechanism is defined branch by 
branch, the mark fc + 1 also appears in a T-derivation. By Proposition 6.12, 
there exists a path in SG(T) with weight > fc. Let us fix some total ordering on 
TZ and define the map h : T TZhy. 

h{u v) = I ^ r 

iff / — > r is the smallest rule of TZ such that ux (resp. vx) labels a branch of 
I (resp. r) and a; is a variable. This map h is an homomorphism of labelled 
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graphs from SG(T) to SG(7?.), i.e. it is compatible with the labels. It follows 
that it is also compatible with the weights. Hence there exists a path of weight 
> A: in SG{n). □ 



Corollary 6.14. Let TZ be a linear system. If W{SG{TZj) is finite then TZ € 
SBU. 

Proposition 6.15. LFPO^ C SBU. 

Proof. Let TZ G LFPO^^. By definition the sticking-out graph of [30] does not 
contain a cycle of weight 1, hence from corollary 6.14, TZ G SBU. So LFPO^^ C 
SBU. 7^o e SBU but 7^o ^ LFPO"\ We conclude that LFPO"^ C SBU. □ 

Example 6.16. Let TZ5 = {f(g(a;),a) ^ f(x,b)}. TZ5 ^ LFPO"^ as SG(7^) 

contains a loop (a) so a loop of weight 1. It is easy to show by an ad-hoc proof 
that TZq G SBU (1). However our sufficient condition is not able to capture TZ5. 

Corollary 6.17. LFPO"^ C SBU C BU. 

7 Perspectives 

Here are some natural perspectives of development for this work: 

1. it is tempting to extend the notion of bottom-up rewriting (resp. system) 
to left-linear but non right-linear systems. This class would extend the 
class of growing systems studied in [23]. 

2. a dual notion of top-down rewriting and a corresponding class of top-down 
systems should be defined. This class would presumably extend the class 
of Layered Transducing systems defined in [27]. 

3. we know that the condition BU(fc) is undecidable (for every k > 1) and 
that the condition SBU(fc) is decidable (for every k > 1); whether the 
condition SBU is decidable is thus a natural question. 

4. the systems considered in [14] and the systems considered here might be 
treated in a unified manner; such a unified approach should lead to an even 
larger class of rewriting systems with still good algorithmic properties. 

Some work in directions 1,2 has been undertaken by the authors. 
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